Gene Cag_0194 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCag_0194 
Symbol 
ID3746681 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium chlorochromatii CaD3 
KingdomBacteria 
Replicon accessionNC_007514 
Strand
Start bp220889 
End bp222049 
Gene Length1161 bp 
Protein Length386 aa 
Translation table11 
GC content47% 
IMG OID637772721 
Producthypothetical protein 
Protein accessionYP_378515 
Protein GI78188177 
COG category[S] Function unknown 
COG ID[COG3876] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATTGTTA CGGGTCTTGA CGTATTATTG CGTAACCTTG ATATGTTACG CCACCGTTCG 
GTAGGGTTAC TTGTGAACCA AACCTCACTT ACTGCCTCAA TGGAATATTC ATGGCAGCTT
TTGCAAAAGC AAGGCATCAC CATACGGCGC ATCTTTTCAC CTGAGCATGG CTTGTTTGCT
ACCGAGCAAG ATCAAATTGC GGTAAGCTAT CAGCCTGAAC TTGGTTGCGA TATGGTAAGT
CTTTACGGCG ATTCCGCTGC AACGTTGGTG CCCGATATGG CGTTGTTGGA TGATCTTGAT
GTGGTGATTT TTGATATTCA AGATGTTGGG GCGCGTTATT ACACCTACGT AAATACTTTA
GCGCTCTTTA TGGAGGCAAT TGCAGGGCGC GATATTGAGC TGATGGTGCT TGATCGTCCG
AATCCTCTCG GTGGAGAAAT TGTGGAAGGT CCAATGCTCG ATATGGCATT TGGTTCCTTC
GTGGGCGTTT TTCCCGTACC TGTTCGCCAT GCGTTAACGG CTGGTGAATT AGCTGTGCTT
TATCGTGATG TTATGCAGCT TGATGTTAAT CTACGCATTA TCAAAATGGA GGGATGGAAG
CGCACTATGC TGTATGGTGA AACAGGTTTG CCTTGGATTC CACCTTCGCC CAATATGCCT
ACGGTTGCTA CGGCTGAAGT CTATCCCGGC ATGTGTTTGT TTGAGGGATT AAATGTGTCG
GAAGGGCGAG GCACTACCAC ACCATTTCAA CTCTCAGGGG CACCATTTAT CCATCCCATT
GAACTTGCTG AACGCTGCCA CTCCTATGGA TTGGAGGGTG TGCGCTTTCG TCCTGTCTGG
TTTAAACCAA CCTTCCATAA ATTTGCAGGT GAGGTAATTG GTGGCATTTG GCAGCAAGTA
ACCGATGCGC GACGTTATCG CTCATTTGCA ACGGCAGTTG CTATGACGGC AGCGCTTCGA
GAGCTTTATG GCGAACAAGT AACCTTTTTA CGTGGTGTTT ATGAATTTAA CGATACCATT
CCTGCCTTCG ACCTTTTAGC TGGTAACGCC ACTATTCGCA CAGCCATTGA GAGCGGCAAC
ACTATCCATA CTCTTCTCAC CTTATGGCAA AAGGATGAAG CACAATTTGC CGAAACTAAA
ACTCGCTATC ACCTCTATTA A
 
Protein sequence
MIVTGLDVLL RNLDMLRHRS VGLLVNQTSL TASMEYSWQL LQKQGITIRR IFSPEHGLFA 
TEQDQIAVSY QPELGCDMVS LYGDSAATLV PDMALLDDLD VVIFDIQDVG ARYYTYVNTL
ALFMEAIAGR DIELMVLDRP NPLGGEIVEG PMLDMAFGSF VGVFPVPVRH ALTAGELAVL
YRDVMQLDVN LRIIKMEGWK RTMLYGETGL PWIPPSPNMP TVATAEVYPG MCLFEGLNVS
EGRGTTTPFQ LSGAPFIHPI ELAERCHSYG LEGVRFRPVW FKPTFHKFAG EVIGGIWQQV
TDARRYRSFA TAVAMTAALR ELYGEQVTFL RGVYEFNDTI PAFDLLAGNA TIRTAIESGN
TIHTLLTLWQ KDEAQFAETK TRYHLY