Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cag_2004 |
Symbol | |
ID | 3747114 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium chlorochromatii CaD3 |
Kingdom | Bacteria |
Replicon accession | NC_007514 |
Strand | - |
Start bp | 2540686 |
End bp | 2541981 |
Gene Length | 1296 bp |
Protein Length | 431 aa |
Translation table | 11 |
GC content | 45% |
IMG OID | 637774541 |
Product | sulfide dehydrogenase, flavoprotein subunit |
Protein accession | YP_380295 |
Protein GI | 78189957 |
COG category | [R] General function prediction only |
COG ID | [COG0446] Uncharacterized NAD(FAD)-dependent dehydrogenases |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.141136 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGTAATG GATTATCACG TAGGGATTTT AACAAGCTGC TGTTGTCGGG TGTTGCTGGT TCAACGATTG GTTTATTTGG CAACTCGGGA ACTCTGTTTG GTGCAACCAG CAAGCGGGTT GTGGTAATTG GTGGTGGCTT TGGTGGTGCC TCAGCCGCAA AATATCTTCG TAAACTTGAC CCAACCATTC AAGTAACGTT GGTTGAACCA AAAAGCGTTT ATCACACCTG TCCATTTAGC AACTGGGTGC TTAGTGGACT AAAGAATATG GAAGATATTG CCCACTTTTA CGATGTTCTT AGAAATCGCT ACAAAGTGAA CGTTATTGCT GATACTGCCG TCAGCATTGA TGCTGACAAG AGCAGTGTTA CGCTACAAAC AGGCAAAACT TTATATTTCG ATCGTTTAAT TGTAGCTCCC GGTATTGACT TTAAATATGA TTCCGTTCAA GGTTATAGTG AAAATGTTGC TAATTCGGTA ATGCCTCATG CGTGGCAAGC TGGTCCGCAA ACAATTTTGT TGCACAAACA ACTACAAGCC ATGCCAAATG GTGGTAAAGT ATTTATTAGC GCCCCTGCAA ACCCCTTCCG TTGCCCACCA GGACCTTATG AGCGTGCCAG CTTAATTGCA CGTTACTTAA AAGAGCAAAA GCCACTATCC AAAGTTATTA TTTTTGATGC TAAAGAGAGC TTCTCAAAGC AAGGGCTCTT TAAGCAAGCT TGGGAACGCC TTTATCCCGG CATGATTGAG TGGCGTGCCT CCACTATGGG CGGTAAAGTG GTATCGGTTG ATGCTGCAAC CATGACGGTT ACCACTGAGT TTGGTGCCGA AAAGGGAGAT GTTATCAATA TTATTCCTGC CCAAAAAGCA GGTAAAATTG CGGTTGATGC TGGGCTTACC GATGCTTCAG GCTGGTGCCC GATTAATCCC ATCTCCTTTG AGTCAACCTT GCATCCTGGC ATTCACGTTA TTGGTGATGC TGCTATTGCT GGCGCTATGC CAAAGTCAGG CTTTGCGGCA AGTAGCCAAG GTAAGGTTGC CGCAGCGGCA ATTGTGCGCC TCTTCCAAGG CAAGGTTCCT GCACCACCTT CACTTGTTAA CACCTGCTAT AGTTTAATTG ATAAGAACTA TGCTATATCG GTTGCTGGTG TTTATAAACT TGCAATGACG GGTATTGTAG AAATTAAAGG TTCAGGCGGC TTAACACCAA TGAATGCTGA TGCCGATCAG CTTGAGCAAG AGGCAATGTT TGCCCAAGGC TGGTACGATA ATATTTCCCA AGACGTTTGG GGATAA
|
Protein sequence | MSNGLSRRDF NKLLLSGVAG STIGLFGNSG TLFGATSKRV VVIGGGFGGA SAAKYLRKLD PTIQVTLVEP KSVYHTCPFS NWVLSGLKNM EDIAHFYDVL RNRYKVNVIA DTAVSIDADK SSVTLQTGKT LYFDRLIVAP GIDFKYDSVQ GYSENVANSV MPHAWQAGPQ TILLHKQLQA MPNGGKVFIS APANPFRCPP GPYERASLIA RYLKEQKPLS KVIIFDAKES FSKQGLFKQA WERLYPGMIE WRASTMGGKV VSVDAATMTV TTEFGAEKGD VINIIPAQKA GKIAVDAGLT DASGWCPINP ISFESTLHPG IHVIGDAAIA GAMPKSGFAA SSQGKVAAAA IVRLFQGKVP APPSLVNTCY SLIDKNYAIS VAGVYKLAMT GIVEIKGSGG LTPMNADADQ LEQEAMFAQG WYDNISQDVW G
|
| |