Gene Cag_1896 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCag_1896 
Symbol 
ID3747641 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium chlorochromatii CaD3 
KingdomBacteria 
Replicon accessionNC_007514 
Strand
Start bp2409689 
End bp2411413 
Gene Length1725 bp 
Protein Length574 aa 
Translation table11 
GC content49% 
IMG OID637774433 
Producthypothetical protein 
Protein accessionYP_380189 
Protein GI78189851 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG2931] RTX toxins and related Ca2+-binding proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones33 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGTATC CCTCATTATT TCCGTTTATC CTTAGTGCGG CAACAATGGT TGCAAGCACG 
CAATCCATAC AAGCTACCGA AACAAAGCGT CATGCTGATG CAAAGGAATT GCCTCAAGTA
ACAATACCTG ATCTCCCCTT TGCCACAGCA CCCGATGCCC CACAAGCGCT TGGGCTACGC
CAAAGCAGTA TTACAGGCAG CGCAGAGCAT GACTATATCA AACTCTATGG CAATGGAGGA
CAAGCGCGCC ATCCTGAGCC AGTGCGCAAT CCCTCGTTAA CACGCTCCAA CACCTTTAGC
TCTACCGTAG CAATTGAGTT GAATGGTACC ATTCTTGCAG GCACTGAAGC CGCAACTCCA
ACGGTGCGCT TTTTTATTAA TGGTAAAGAT GTGGGAGTGG CTACACTGAG CACTGAGCAA
AGTGCGTACA GTAAAAAAAC AGGGGGCGTT CCGCATAGCG ATTTACAACG CTTTACCTTT
CATGTTGATG AGCTTGCTAT TCGGGAAATC AAGTTGGTTG TGGAGTCAGC GCCCGTACCG
CAATCGGAGG TCTATATTCA TCGAGTAAAT ATTAATGCTG AAGTAAATCT TGACCAAAAT
CTTGAGGCGA ATGCTTTGCG AGGAGCGGCT GTGAATTTTG CTACTCCATC AGCCTTATGG
GAAGGGCGCA ATGGATATCA GTTGCCACAA GGCGCTATTC CCAGCGATGT GCGCTCGGTA
ACCATTGACA CCGCACTTTA TCAAACTACC CTACAACGCG CACCTGGTAC CCCATCCAAT
CCCATTATTG TGCAAGGTGG TGGTGGCGAA GATACCCTCT ACTTGCTTGG CTCCTCCGTG
CAATATTTAT TAGCCGTAGA TGAAGGTGGC ACCTTGGTGA TTGCGGAGTC GCAAGGGCTG
GATCAAAATG CTTTAGCAAC CAACATTGCT CGCCTTGAAT TTGCCGATGG CAGCTTTTTT
TTAGCGCCAC AAACCGTTGC AGGCAAAGCA ATAACGTTAG CGACAGGTAG TGAGGGGAGC
AAGCAGCTTC GTGCACAACT ACCAGCAGGC ATGGGCATTA CCGTGCGCCC ACTCAGCAGC
AGCGCTATGC AGGAACAGCT CCGTTCGTTA GTTGGCAAAA GCGTGCAACC AACGATTGAA
CAACGTTTAA GCGCGTTGTT TGTCAATCAG CTTGAACCTC AGCTTGTGCT ACGAGGTTTA
GATTTTGCAA CCTTGCCAAT GCTTCGCCAT GCGGCTGATG TGGAGCTGAA CGGTAGCACA
AAAGGCAAGC AATGGGTAAT GGTTGAAAGC AATGCCTTGC CCGATGGCGC TCATGTAACA
GTAAAGGATG TTGATGGTTT GTTGCTTTCG GGTAGCCGCG ATCTCTTAGT ACAAAGCAAA
GGGAAGTCGG TAACCATTGT GGCAGGCGAT GGCAATCAAG AGCTACGAAG TGAAGGAGGG
AATGATCTGC TTTTTGGTGG CAACGGTAAC GATCGCCTTT TTGCGGGCGC TGGCAACGAT
GAGCTTTGTG GGGGCAATGG CGATAACCTT CTTGATGGAG GTGCAGGAAG CGACGTTGCA
TTCTTTAGCG GCAATGTAGC CGAGTATCGC ATGACGCACA ATGCGGCTAC AAATATGACA
AGCGTGGTTG ATAGTATTCC AAATCGTGAT GGAAGCAATC AACTCATTAA TATAGAGCAA
CTTCGCTTTG CTGATCGCAC GGAACTAATT GCACAAGGGA AATAG
 
Protein sequence
MKYPSLFPFI LSAATMVAST QSIQATETKR HADAKELPQV TIPDLPFATA PDAPQALGLR 
QSSITGSAEH DYIKLYGNGG QARHPEPVRN PSLTRSNTFS STVAIELNGT ILAGTEAATP
TVRFFINGKD VGVATLSTEQ SAYSKKTGGV PHSDLQRFTF HVDELAIREI KLVVESAPVP
QSEVYIHRVN INAEVNLDQN LEANALRGAA VNFATPSALW EGRNGYQLPQ GAIPSDVRSV
TIDTALYQTT LQRAPGTPSN PIIVQGGGGE DTLYLLGSSV QYLLAVDEGG TLVIAESQGL
DQNALATNIA RLEFADGSFF LAPQTVAGKA ITLATGSEGS KQLRAQLPAG MGITVRPLSS
SAMQEQLRSL VGKSVQPTIE QRLSALFVNQ LEPQLVLRGL DFATLPMLRH AADVELNGST
KGKQWVMVES NALPDGAHVT VKDVDGLLLS GSRDLLVQSK GKSVTIVAGD GNQELRSEGG
NDLLFGGNGN DRLFAGAGND ELCGGNGDNL LDGGAGSDVA FFSGNVAEYR MTHNAATNMT
SVVDSIPNRD GSNQLINIEQ LRFADRTELI AQGK