Gene Cag_1381 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCag_1381 
Symbol 
ID3746565 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium chlorochromatii CaD3 
KingdomBacteria 
Replicon accessionNC_007514 
Strand
Start bp1845111 
End bp1847096 
Gene Length1986 bp 
Protein Length661 aa 
Translation table11 
GC content49% 
IMG OID637773917 
Productalpha-amylase family protein 
Protein accessionYP_379682 
Protein GI78189344 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0366] Glycosidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTACTT CTCCCAACAA CGCTCCCTGT AAAGCCACCT GCACTGTGCA AACACCTCAT 
CTTGATTTAC TTCTTCAAAC ACTTGAAAGC CTGACCCCAA AGCAACCTGC GCAACCTGCG
CCCCCTGCAT TAAGTCAGCA GCCTTACCGC GTGCCTACGT TATGGCAAAG CGACAACGTA
GGTGTTGAGG TGATTGAGCC TGCCGAGTAT TACGCTCGCT GCATTCGCTC GTTGCTTGAC
TCAGCGCACA AAGCGCAACC GCAAGATATT GATGGTGATT GGAAGCGCCA TGCCGTTGTT
TATAATCTTT TTATTCGCTT AACCACAGCC TTTGACCATA ATGGCGATGG TGAACTCAGT
AATGAGCCGA TGGAGTGCGG TTTTCGAGAA ACGGGTACCT TGCTGAAAGC TATTGCACTA
CTCCCTTATA TTGAACGACT TGGCGCTAAC ACGCTCTACC TACTCCCCCT TACTGCTATT
GGAGAACAAA ACCGTAAAGG CTCGCTTGGC TCACCCTACG CTGTAAAAAA TCCCCGCACG
CTTGACCCCA TGCTTGCCGA ACCTGCGCTT GGCTTATCGG CGGAAATATT ACTTAAAGCT
TTTGTGGAAG CCGCCCATTT GCGCAACATG CGTGTGGTCT TTGAATTTGT GTTTCGCACA
ACGGCAGTGG ATAGCGATTG GGTTCGAGAA CATCCTGAAT GGTTTTATTG GCTCAAAGAG
AGTGAAGCGA ATGAAACCTT TGGTGCTCCG TACTTTGAGC AAGAGCGGCT GAACGCCATT
TATGCAGCCG TTGACCGTAA CGATATGAAT AACCTGCCTG CACCCGACGA GGCATATCGT
GCAAAGTTTG TGGCACCACC ACAGCAGGTG GTAGAGCGTG ATGGCGCATT GGTTGGTATG
CAAAAGGATG GCACAACCTG CCGCATAGCA AGCGCATTTT CCGATTGGCC ACCCGATGAT
CGCCAACCAG CTTGGAGCGA TGTTACCTAC CTGAAAATGC ACCAGCACGA GGGCTTTAAC
TACATGGCGT ACAACACCAT ACGCATGTAC GACAGCACTC TTGAACGTCC TGAATATCGC
ACCACAAATT TGTGGGAAAC GCTTATCTCT ATTATTCCCG AATTTCAGGA TGAGTATCAC
ATTGATGGAG CCATGATTGA TATGGGGCAC GCCTTGCCTG CGCCGTTAAA GCAAACCATT
GTTGAACGTG CTCGCGCTAA ACGCCCCGAT TTTGCTTTTT GGGATGAAAA CTTCAATCCG
TCGGTTGAAA TTCGTGAACA TGGCTTTGAT GCCGTTTTTG GCTCTCTGCC GTTTGTAGTG
CACGACATTA TTTTTATTCG CGGTTTACTT AACCACTTAA ACCGTATTGG TGTTGCCCTT
CCCTTTTTTG GAACGGGCGA AAACCACAAC ACGCCGCGCG TTTGCTTTCG CTACCCCAAA
CAAGCCGCGG GACGTAGCTT AGCCACCTTT ATTTTCACCC TGAGCGCCAT TCTCCCCTCA
CTTCCTTTTT TACAATCGGG AATGGAATTG TGTGAATGGC ATCCCGTGAA TTTAGGGTTG
AACTTTACCG ATGAAGATCG CGCCACGTAT CCGTCCGAAA CATTACCACT CTTTAGCCCA
CGCGCTTACG ATTGGGAAAA AAGCAACAAT CTTGAACCTC TCAACCACTA CATTAAGCGC
CTACTCACCG TGCGCGAACG CTATCTTGAT GTAGTGCTAT GCGGAGATGC TGGCTCAATT
GGCGTACCGT ATGTAAGCCA TCCCGAACTT TTTGCCGTGA TGCGTAGCGC AGGTGGAAAA
TCGTTGCTCT TTGTGGGCAA TAGCAACCTT ACCGAAAGCC GCACAGGCAT GCTTGAGTTT
AGTGTGGAAC AAGCTTCGTT AGTGGAGTTA ATTTCCGAGC GTCCCTATAC AATTACAAAT
CACCGTTTAG AAGTAAGCTG CACGGCTGGC GAATGTTTAC TTTTTGAAAT TCCCTCTTTT
TCTTAA
 
Protein sequence
MSTSPNNAPC KATCTVQTPH LDLLLQTLES LTPKQPAQPA PPALSQQPYR VPTLWQSDNV 
GVEVIEPAEY YARCIRSLLD SAHKAQPQDI DGDWKRHAVV YNLFIRLTTA FDHNGDGELS
NEPMECGFRE TGTLLKAIAL LPYIERLGAN TLYLLPLTAI GEQNRKGSLG SPYAVKNPRT
LDPMLAEPAL GLSAEILLKA FVEAAHLRNM RVVFEFVFRT TAVDSDWVRE HPEWFYWLKE
SEANETFGAP YFEQERLNAI YAAVDRNDMN NLPAPDEAYR AKFVAPPQQV VERDGALVGM
QKDGTTCRIA SAFSDWPPDD RQPAWSDVTY LKMHQHEGFN YMAYNTIRMY DSTLERPEYR
TTNLWETLIS IIPEFQDEYH IDGAMIDMGH ALPAPLKQTI VERARAKRPD FAFWDENFNP
SVEIREHGFD AVFGSLPFVV HDIIFIRGLL NHLNRIGVAL PFFGTGENHN TPRVCFRYPK
QAAGRSLATF IFTLSAILPS LPFLQSGMEL CEWHPVNLGL NFTDEDRATY PSETLPLFSP
RAYDWEKSNN LEPLNHYIKR LLTVRERYLD VVLCGDAGSI GVPYVSHPEL FAVMRSAGGK
SLLFVGNSNL TESRTGMLEF SVEQASLVEL ISERPYTITN HRLEVSCTAG ECLLFEIPSF
S