Gene Cag_2004 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCag_2004 
Symbol 
ID3747114 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium chlorochromatii CaD3 
KingdomBacteria 
Replicon accessionNC_007514 
Strand
Start bp2540686 
End bp2541981 
Gene Length1296 bp 
Protein Length431 aa 
Translation table11 
GC content45% 
IMG OID637774541 
Productsulfide dehydrogenase, flavoprotein subunit 
Protein accessionYP_380295 
Protein GI78189957 
COG category[R] General function prediction only 
COG ID[COG0446] Uncharacterized NAD(FAD)-dependent dehydrogenases 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.141136 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTAATG GATTATCACG TAGGGATTTT AACAAGCTGC TGTTGTCGGG TGTTGCTGGT 
TCAACGATTG GTTTATTTGG CAACTCGGGA ACTCTGTTTG GTGCAACCAG CAAGCGGGTT
GTGGTAATTG GTGGTGGCTT TGGTGGTGCC TCAGCCGCAA AATATCTTCG TAAACTTGAC
CCAACCATTC AAGTAACGTT GGTTGAACCA AAAAGCGTTT ATCACACCTG TCCATTTAGC
AACTGGGTGC TTAGTGGACT AAAGAATATG GAAGATATTG CCCACTTTTA CGATGTTCTT
AGAAATCGCT ACAAAGTGAA CGTTATTGCT GATACTGCCG TCAGCATTGA TGCTGACAAG
AGCAGTGTTA CGCTACAAAC AGGCAAAACT TTATATTTCG ATCGTTTAAT TGTAGCTCCC
GGTATTGACT TTAAATATGA TTCCGTTCAA GGTTATAGTG AAAATGTTGC TAATTCGGTA
ATGCCTCATG CGTGGCAAGC TGGTCCGCAA ACAATTTTGT TGCACAAACA ACTACAAGCC
ATGCCAAATG GTGGTAAAGT ATTTATTAGC GCCCCTGCAA ACCCCTTCCG TTGCCCACCA
GGACCTTATG AGCGTGCCAG CTTAATTGCA CGTTACTTAA AAGAGCAAAA GCCACTATCC
AAAGTTATTA TTTTTGATGC TAAAGAGAGC TTCTCAAAGC AAGGGCTCTT TAAGCAAGCT
TGGGAACGCC TTTATCCCGG CATGATTGAG TGGCGTGCCT CCACTATGGG CGGTAAAGTG
GTATCGGTTG ATGCTGCAAC CATGACGGTT ACCACTGAGT TTGGTGCCGA AAAGGGAGAT
GTTATCAATA TTATTCCTGC CCAAAAAGCA GGTAAAATTG CGGTTGATGC TGGGCTTACC
GATGCTTCAG GCTGGTGCCC GATTAATCCC ATCTCCTTTG AGTCAACCTT GCATCCTGGC
ATTCACGTTA TTGGTGATGC TGCTATTGCT GGCGCTATGC CAAAGTCAGG CTTTGCGGCA
AGTAGCCAAG GTAAGGTTGC CGCAGCGGCA ATTGTGCGCC TCTTCCAAGG CAAGGTTCCT
GCACCACCTT CACTTGTTAA CACCTGCTAT AGTTTAATTG ATAAGAACTA TGCTATATCG
GTTGCTGGTG TTTATAAACT TGCAATGACG GGTATTGTAG AAATTAAAGG TTCAGGCGGC
TTAACACCAA TGAATGCTGA TGCCGATCAG CTTGAGCAAG AGGCAATGTT TGCCCAAGGC
TGGTACGATA ATATTTCCCA AGACGTTTGG GGATAA
 
Protein sequence
MSNGLSRRDF NKLLLSGVAG STIGLFGNSG TLFGATSKRV VVIGGGFGGA SAAKYLRKLD 
PTIQVTLVEP KSVYHTCPFS NWVLSGLKNM EDIAHFYDVL RNRYKVNVIA DTAVSIDADK
SSVTLQTGKT LYFDRLIVAP GIDFKYDSVQ GYSENVANSV MPHAWQAGPQ TILLHKQLQA
MPNGGKVFIS APANPFRCPP GPYERASLIA RYLKEQKPLS KVIIFDAKES FSKQGLFKQA
WERLYPGMIE WRASTMGGKV VSVDAATMTV TTEFGAEKGD VINIIPAQKA GKIAVDAGLT
DASGWCPINP ISFESTLHPG IHVIGDAAIA GAMPKSGFAA SSQGKVAAAA IVRLFQGKVP
APPSLVNTCY SLIDKNYAIS VAGVYKLAMT GIVEIKGSGG LTPMNADADQ LEQEAMFAQG
WYDNISQDVW G