Gene Cphamn1_2056 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCphamn1_2056 
Symbol 
ID6375749 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium phaeobacteroides BS1 
KingdomBacteria 
Replicon accessionNC_010831 
Strand
Start bp2219728 
End bp2221101 
Gene Length1374 bp 
Protein Length457 aa 
Translation table11 
GC content53% 
IMG OID642684547 
ProductAldehyde Dehydrogenase 
Protein accessionYP_001960447 
Protein GI189500977 
COG category[C] Energy production and conversion 
COG ID[COG1012] NAD-dependent aldehyde dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0119311 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATTGTTA CCCTGAACCC CGCTAACGAA GAAGTGCTTG CCGAGTATCC GGTCATGACT 
TCTGCAGAGA TTGACAGGAT TCTTGAAGCT TCGGAGAACG CCGCCCTTAT CTGGAAAAAA
ATCCCGATCG ATGAGCGAAA AATCGCGATG CATCGTCTTG CCGATCTGCT GAGGGAGCAA
AAAGAGATGC ACGGGGCGAT GATCAGCCGT GAGATGGGCA AACGCTATGC CGAGTCGGTC
GCCGAGGTTG AAAAATGCGC ATGGGTCTGT GATTATTACG CCGAACACGC GGAGGCCTTC
CTGCAGCCTG AAAAGGTTGA TATGGATGGC GGAGCCGGAC TTGTGACCTT TGTCCCGCTT
GGTGTCGTTC TCGGGGTCAT GCCCTGGAAT TTTCCTTTCT GGCAGGTGAT TCGTTTTGCG
GCTGCGGTTA TGATGGCAGG GAACGGTGTT GTCATCAAGC ACGCTCCCAA CGTGACCGGA
TCGGCGATCG CGCTGGAAAA CCTTTTTCGT GAAGCCGGTT TTCCCGTGAA CCTGTACAGG
ACTTTGCATA TAGATCTTGA AGATGTTGAT CGCATGGTCG GCCACATCAT CGCTCATCCG
GTGATCAAGG CTGTTTCGGT TACAGGCAGT ACCGGTGCGG GAGTTGCCGT GGCGTCTAAA
GCAGGCAGTG CGCTCAAGAG AAGTGTTCTT GAACTGGGAG GTAATGATCC CTATCTGGTG
CTCGATGATG CTGATCTTGA TGAGGCCGTA GGGTTCTGTA TCGCGTCCCG GCTTTTGAAC
GCGGGTCAGA GTTGTATCGC CGCCAAGCGT TTTGTCGTTC ACCGTTCTGT TACATCACGT
TTCGAGCAAA AGCTGCTCGA TACAATGAGC AAAAAGAAAG TCGGGGATCC TTTTGATCCC
GGCATACACA TAGGGCCGAT AGCGAGGAAA GATCTCAGAG ACGCTCTCCA CCTTCAGGTG
GAGCAGAGCA GAGGGCTCGG GGCAAAGGTC CTTTGCGGAG GCGAGATTCC TGACAGAAAA
GGCTTTTTTT ATCCGCCGAC GATTGTTACG GATGTCTCTG CGGATATGGC GGTCTATAGT
GAAGAGACAT TCGGGCCGGT CGCGACGATT CTTGAAGCAC GGGATGACGA TGATGCCGTC
AGGATCGCCA ATGACAGCCC TTTCGGTCTT GGATCAGCGG TTTTTTCCGG TGATCCTGAC
CGCGCCAGAC GGGTCGCCGC CAGGCTGGAT GCCGGAAACT GCTGTATCAA TTCGATGGTA
AAGTCAGACC CTCGTCTGCC TTTTGGCGGG ATTAAACAGT CAGGTTACGG CCGTGAACTT
TCCAGCTACG GTATTCGGGA GTTCGTCAAT ATCAAATCGA TCTATATCGC TTAG
 
Protein sequence
MIVTLNPANE EVLAEYPVMT SAEIDRILEA SENAALIWKK IPIDERKIAM HRLADLLREQ 
KEMHGAMISR EMGKRYAESV AEVEKCAWVC DYYAEHAEAF LQPEKVDMDG GAGLVTFVPL
GVVLGVMPWN FPFWQVIRFA AAVMMAGNGV VIKHAPNVTG SAIALENLFR EAGFPVNLYR
TLHIDLEDVD RMVGHIIAHP VIKAVSVTGS TGAGVAVASK AGSALKRSVL ELGGNDPYLV
LDDADLDEAV GFCIASRLLN AGQSCIAAKR FVVHRSVTSR FEQKLLDTMS KKKVGDPFDP
GIHIGPIARK DLRDALHLQV EQSRGLGAKV LCGGEIPDRK GFFYPPTIVT DVSADMAVYS
EETFGPVATI LEARDDDDAV RIANDSPFGL GSAVFSGDPD RARRVAARLD AGNCCINSMV
KSDPRLPFGG IKQSGYGREL SSYGIREFVN IKSIYIA