Gene Jann_2558 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagJann_2558 
Symbol 
ID3935021 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameJannaschia sp. CCS1 
KingdomBacteria 
Replicon accessionNC_007802 
Strand
Start bp2557863 
End bp2560784 
Gene Length2922 bp 
Protein Length973 aa 
Translation table11 
GC content63% 
IMG OID637904922 
Productformate dehydrogenase, alpha subunit 
Protein accessionYP_510500 
Protein GI89055049 
COG category[R] General function prediction only 
COG ID[COG3383] Uncharacterized anaerobic dehydrogenase 
TIGRFAM ID[TIGR01591] formate dehydrogenase, alpha subunit, archaeal-type 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.779669 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGGATT TCATTATTCC CTGGGATGAT CGGGACATGG GGACGAAGGC CGTTGACGGC 
GCGCCGGTGA CCCTGACTGT GGACGGATTT GAGGTGACGG TGCCCGAGGG CACGAGTGTC
ATGCGGGCGG CGTCGGAGAT TGGGATGCAG ATCCCGAAGC TTTGCGCGTC GGACAATCTG
GAGGCGTTTG GATCCTGCCG GTTATGTGTG GTGGAGATCG AGGGGCGGCG CGGCACGCCC
GCGTCCTGCA CCACGCCCGT GGCCGAGGGT ATGGTCGTGC GCACCCAGTC CAGCAAGGTG
AAGAAGATCC GTCGCGGCGT GATGGAGCTG TATATCTCCG ACCACCCGCT CGATTGCCTG
ACCTGTGCGG CCAATGGCGA TTGCGAATTG CAGGACATGG CGGGGGCCGT GGGCCTGCGC
GATGTGCGGT ATGAGGCGCC GGAGAACGCG TCGCTGGCGA ACCATTTCGA GGCCCGCAAA
GGCGACCAAC CGAACCCGGA ATGGATCCCC AAGGACGACA GCAATCCGTA CTTCACCTAC
GATCCCGCCA AATGCATCGC GTGTTCACGC TGCGTCCGGG CTTGCGAAGA GGTGCAGGGG
ACCTTTGCCC TGACCATGGA AGGACGCGGG TTCGATAGCC GCATTTCCGC CGGGGGGCCG
GACAGTGATT TCCTGACCTC CGATTGTGTA TCCTGCGGGG CCTGTGTGCA GGCCTGCCCG
ACGGCGACGT TGCAGGAGAA ATCCGTCATT GAATTGGGCA CGCCCGAGCG CTCCGTCGTG
ACGACCTGCG CCTATTGCGG CGTGGGGTGT TCGTTCAAGG CGGAATTGAA CGGCGACGAA
TTGGTGCGGA TGACGCCCTA CAAGCACGGG GAGGCGAACC GCGGCCATTC CTGCGTGAAG
GGGCGGTTCG CTTACGGTTA TGCCAACCAT TCCGACCGCA TCCTCAACCC GATGATCCGC
GACAGCATTG ATCAGCCTTG GAAAGAGGTG TCCTGGGAGG AGGCGATTGG CTTTGCCGCC
GACCGGATGC GCGGTTTGCA GGAAAAGCAC GGGCGCAAGA GCATCGGCGT GATCACCTCG
TCGCGCTGCA CGAATGAAGA GACCTATCTT GTGCAGAAGC TGGCGCGCGG CGTCTTCATG
AACAACAACA CCGACACCTG CGCGCGGGTC TGCCATTCGC CCACCGGCTA CGGTTTGGGT
CAGACGTTCG GCACCAGTGC GGGCACGCAG AACTTCGATA GCGTTGAACA GGTGGACGTC
GCCATCATCA TCGGCGCGAA CCCGACGGAC GCGCATCCGG TCTTTGGCAG CCGCCTGAAA
AAGCGCGTGC GGGCAGGGGC TAAGCTGATC GTGATCGACC CGCGCAAGAC CGATGTGGTG
CGCTCAGCCC ATATCGAGGC GGCGCATCAT TTGCCGCTGC GCCCGGGCAC CAATGTGGCC
GTGGTCACCG CGATGGCCCA TGTGATCGTG GATGAGAAGA TCTATGATGA GCAATTCATC
CGCGAGCGGT GTGACTGGAA TGAGTTCGAG GAATACGCGG AATTCGTCCG GGATGTCCGC
CATTCCCCCG AAATGACGGA GATGCTGACC GGCGTCCCGG CGGAGGAGCT GCGCCGCGCC
GCCCGCCTCT ATGCCACCGG TGGCAATGGT GCCATTTATT ACGGCCTCGG CGTGACCGAG
CATTCCCAGG GCTCCACCAC GGTCATGGGC ATCGCCAACC TCGCGATGAT GACCGGCAAT
ATCGGGCGCG AAGGCGTTGG CGTGAACCCG CTGCGGGGCC AGAACAACGT GCAGGGCTCT
TGCGATATGG GCTCTTTCCC GCACGAATTG CCCGGGTATC GCCATGTGAA AGGCGATGAG
GTCCGCGCGC TGTTCGAGAG CAAATGGGGG GTGGAGATTG ACCCGGAGCC CGGCCTGCGC
ATCCCCAATA TGCTCGACAG CGCCGTCGGC GGCACCTTCA AGGGCCTGTA TTGTCAGGGC
GAGGACATCC TGCAATCGGA CCCCGACACA CACCACGTCG CGGCGGGCCT CGCAGCCATG
GAATGCGTCA TCGTCCACGA TCTGTTCCTC AACGAGACGG CGAATTACGC CCACGTTTTC
CTGCCCGGAT CGACGTTCCT GGAGAAGGAC GGCACCTTCA CCAATGCCGA GCGGCGCATC
AACCGCGTGC GCGAGGTGAT GAAACCCAAG AACGGCTATG CCGATTGGGA GGTGACGCAG
CTGCTGGCGA AGGCCATGGG CGCGGACTGG CATTATACCC ATCCGTCCCA GATCATGGAT
GAAATCGCGG AGACGACGCC CGGCTTCGCC AACGTCAACT ATGCGATGCT GGAAGAGCGT
GGCAGCGTGC AGTGGCCCTG CAATGACGCG GCCCCCGACG GGTCCCCGAT CATGCATATC
GATGGTTTTG TGCGCGGCAA GGGGCGGTTC ATCGTGACCG AATATATCGC CACCGAAGAA
CGCACCGGGC CGCGCTTCCC GCTATTGCTG ACGACGGGGC GCATCCTGTC GCAGTATAAC
GTGGGCGCGC AGACGCGGCG GACCGAAAAC GTGGTCTGGC ACGCCGAGGA TGTGCTGGAA
ATCCACCCCC ATGACGCCGA AGTGCGCGGC GTGAAAGACG GCGATTGGGT CAAGCTGGCC
TCGCGCACCG GAGAGACGTC CCTGCGCGCC ACGATCACCG ACAAGGTCGT GCCGGGGGTC
GTCTACACCA CCTTCCACCA CCCCGATACG CAGGCCAATG TCATCACCAC GGATCATTCC
GACTGGGCGA CGAACTGCCC GGAATACAAG GTGACGGCAG TGCAGGTGGG GCTGTCCAAC
GGCCCGACAG AATGGCAGCG CGCGTATAAC GCGCAGGCCG AAAAGTCGCG GCGCATCCAA
TCAAGTGACG GCGTGCGTGG GCGCGCGACG GCGGCGGAGT GA
 
Protein sequence
MKDFIIPWDD RDMGTKAVDG APVTLTVDGF EVTVPEGTSV MRAASEIGMQ IPKLCASDNL 
EAFGSCRLCV VEIEGRRGTP ASCTTPVAEG MVVRTQSSKV KKIRRGVMEL YISDHPLDCL
TCAANGDCEL QDMAGAVGLR DVRYEAPENA SLANHFEARK GDQPNPEWIP KDDSNPYFTY
DPAKCIACSR CVRACEEVQG TFALTMEGRG FDSRISAGGP DSDFLTSDCV SCGACVQACP
TATLQEKSVI ELGTPERSVV TTCAYCGVGC SFKAELNGDE LVRMTPYKHG EANRGHSCVK
GRFAYGYANH SDRILNPMIR DSIDQPWKEV SWEEAIGFAA DRMRGLQEKH GRKSIGVITS
SRCTNEETYL VQKLARGVFM NNNTDTCARV CHSPTGYGLG QTFGTSAGTQ NFDSVEQVDV
AIIIGANPTD AHPVFGSRLK KRVRAGAKLI VIDPRKTDVV RSAHIEAAHH LPLRPGTNVA
VVTAMAHVIV DEKIYDEQFI RERCDWNEFE EYAEFVRDVR HSPEMTEMLT GVPAEELRRA
ARLYATGGNG AIYYGLGVTE HSQGSTTVMG IANLAMMTGN IGREGVGVNP LRGQNNVQGS
CDMGSFPHEL PGYRHVKGDE VRALFESKWG VEIDPEPGLR IPNMLDSAVG GTFKGLYCQG
EDILQSDPDT HHVAAGLAAM ECVIVHDLFL NETANYAHVF LPGSTFLEKD GTFTNAERRI
NRVREVMKPK NGYADWEVTQ LLAKAMGADW HYTHPSQIMD EIAETTPGFA NVNYAMLEER
GSVQWPCNDA APDGSPIMHI DGFVRGKGRF IVTEYIATEE RTGPRFPLLL TTGRILSQYN
VGAQTRRTEN VVWHAEDVLE IHPHDAEVRG VKDGDWVKLA SRTGETSLRA TITDKVVPGV
VYTTFHHPDT QANVITTDHS DWATNCPEYK VTAVQVGLSN GPTEWQRAYN AQAEKSRRIQ
SSDGVRGRAT AAE