Gene Cag_1441 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCag_1441 
Symbol 
ID3746640 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium chlorochromatii CaD3 
KingdomBacteria 
Replicon accessionNC_007514 
Strand
Start bp1908495 
End bp1909598 
Gene Length1104 bp 
Protein Length367 aa 
Translation table11 
GC content34% 
IMG OID637773976 
Producthypothetical protein 
Protein accessionYP_379741 
Protein GI78189403 
COG category[S] Function unknown 
COG ID[COG3177] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTAGCTT TTATCCATCA AAAAACCAAT TGGCCTTATT TCACTTGGAA CAATGATGAA 
ATAGTTAATG CGCTGAGTGA AGCAAGAAAT TTGCAAGGAA GAGTCATTGG TAAAATGGAA
TCTTTAGGAT TCGACCTAAG AAATGAAGCT CTACTTGACA CGTTAACACT TGATGTATTA
AAATCGTCAG AAATAGAAGG AGAATATTTA AATCCTGAAC AGGTTCGTTC CTCAATTGCC
CGTAGATTGG GAATGGAAAT TGCCGGTTCT GTTGAGTCGG ATAGAAATGT TGATGGCGTA
GTCGAAATGA TGTTGGATGC AACACAAAAT TGCTTTAAAC CATTAACAGT TGAAAGACTC
TTCGATTGGC ATGCAGCATT ATTCCCGACT GGAAGAAGTG GAATGCTCAA AATTACAGTC
GGCGATTGGC GAAAAGATAC GACAGGTCCA ATGCAAGTTG TGTCGGGAGC CTTAGGAAAG
GAAAAAGTGC ATTTTCAAGC TCCCGATTCG ATAGTTGTTG AAAAAGAGAT GAATCAGTTT
TTAGAGTGGA TTAATAATAA TGTAAAAATT GATTTAGTCA TTAAAGCTGC TATAGCTCAC
TTATGGTTTG TTACCATCCA TCCATTTGAA GATGGAAATG GTAGGATAAC AAGAGCGTTG
ACCGATATGT TATTGGCACA ATCGGATAAT AGCAATCAGC GTTTTTATAG TATGTCTGCA
CAAATCAGAA TTGAAAGAAA GCAATATTAT GACATACTGG AAAAGACACA AAAAGGGAAC
CTTGATATAA CAGAATGGAT TCAGTGGTTT TTAAACTGCC TTATTAATGC TTTAAAATCA
ACTGATGCTA CATTATTTAA CGTTTTATTA AAAGCAAACT TCTGGAGTAA ACATTCTAAA
ACATTGATAA ATGAAAGACA GAAGAAACTT TTAAATAAAT TATTAGATGG ATTTGATGGA
AAAATAACAT CATCAAAATG GGCAAAGATT GCAAAATGCT CAAAAGACAC TGCCATAAGA
GATATAAATG ATTTGATAGA AAAAAATATT CTACAAAAAG AAGCAGGAGG AGGAAGAAGT
ACAAATTATG AATTAAAGAT ATGA
 
Protein sequence
MVAFIHQKTN WPYFTWNNDE IVNALSEARN LQGRVIGKME SLGFDLRNEA LLDTLTLDVL 
KSSEIEGEYL NPEQVRSSIA RRLGMEIAGS VESDRNVDGV VEMMLDATQN CFKPLTVERL
FDWHAALFPT GRSGMLKITV GDWRKDTTGP MQVVSGALGK EKVHFQAPDS IVVEKEMNQF
LEWINNNVKI DLVIKAAIAH LWFVTIHPFE DGNGRITRAL TDMLLAQSDN SNQRFYSMSA
QIRIERKQYY DILEKTQKGN LDITEWIQWF LNCLINALKS TDATLFNVLL KANFWSKHSK
TLINERQKKL LNKLLDGFDG KITSSKWAKI AKCSKDTAIR DINDLIEKNI LQKEAGGGRS
TNYELKI