Gene EcolC_2028 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_2028 
SymbolpntB 
ID6067913 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp2240132 
End bp2241520 
Gene Length1389 bp 
Protein Length462 aa 
Translation table11 
GC content53% 
IMG OID641601440 
Productpyridine nucleotide transhydrogenase 
Protein accessionYP_001724999 
Protein GI170020045 
COG category[C] Energy production and conversion 
COG ID[COG1282] NAD/NADP transhydrogenase beta subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.442796 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.870714 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTGGAG GATTAGTTAC AGCTGCATAC ATTGTTGCCG CGATCCTGTT TATCTTCAGT 
CTGGCCGGTC TTTCGAAACA TGAAACGTCT CGCCAGGGTA ACAACTTCGG TATCGCCGGG
ATGGCGATTG CGTTAATCGC AACCATTTTT GGACCGGATA CGGGTAATGT TGGCTGGATC
TTGCTGGCGA TGGTCATTGG TGGGGCAATT GGTATCCGTC TGGCGAAGAA AGTTGAAATG
ACCGAAATGC CAGAACTGGT GGCGATCCTG CATAGCTTCG TGGGTCTGGC GGCAGTGCTG
GTTGGCTTTA ACAGCTATCT GCATCATGAC GCGGGAATGG CACCGATTCT GGTCAATATT
CACCTGACGG AAGTGTTCCT CGGTATCTTC ATCGGGGCGG TAACGTTCAC GGGTTCGGTG
GTGGCGTTCG GCAAACTGTG TGGCAAGATT TCGTCTAAAC CATTGATGCT GCCAAACCGT
CACAAAATGA ACCTGGCGGC TCTGGTCGTT TCCTTCCTGC TGCTGATTGT ATTTGTTCGC
ACGGACAGCG TCGGCCTGCA AGTGCTGGCA TTGCTGATAA TGACCGCAAT TGCGCTGGTA
TTCGGCTGGC ATTTAGTCGC CTCCATCGGT GGTGCAGATA TGCCAGTGGT GGTGTCGATG
CTGAACTCGT ACTCCGGCTG GGCGGCTGCG GCTGCGGGCT TTATGCTCAG CAACGACCTG
CTGATTGTGA CCGGTGCGCT GGTCGGTTCT TCGGGGGCTA TCCTTTCTTA CATTATGTGT
AAGGCGATGA ACCGTTCCTT TATCAGCGTT ATTGCGGGTG GTTTCGGCAC CGACGGCTCT
TCTACTGGCG ATGATCAGGA AGTGGGTGAG CACCGCGAAA TCACCGCAGA AGAGACAGCG
GAACTGCTGA AAAACTCCCA TTCAGTGATC ATTACTCCGG GGTACGGCAT GGCAGTCGCG
CAGGCGCAAT ATCCTGTCGC TGAAATTACT GAGAAATTGC GCGCTCGTGG TATTAATGTG
CGTTTCGGTA TCCACCCGGT CGCGGGGCGT TTGCCTGGAC ATATGAACGT ATTGCTGGCT
GAAGCAAAAG TACCGTATGA CATCGTGCTG GAAATGGACG AGATCAATGA TGACTTTGCT
GATACCGATA CCGTACTGGT GATTGGTGCT AACGATACGG TTAACCCGGC GGCGCAGGAT
GATCCGAAGA GTCCGATTGC TGGTATGCCT GTGCTGGAAG TGTGGAAAGC GCAGAACGTG
ATTGTCTTTA AACGTTCGAT GAACACTGGC TATGCTGGTG TGCAAAATCC GCTGTTCTTC
AAGGAAAACA CCCACATGCT GTTTGGTGAC GCCAAAGCCA GCGTGGATGC AATCCTGAAA
GCTCTGTAA
 
Protein sequence
MSGGLVTAAY IVAAILFIFS LAGLSKHETS RQGNNFGIAG MAIALIATIF GPDTGNVGWI 
LLAMVIGGAI GIRLAKKVEM TEMPELVAIL HSFVGLAAVL VGFNSYLHHD AGMAPILVNI
HLTEVFLGIF IGAVTFTGSV VAFGKLCGKI SSKPLMLPNR HKMNLAALVV SFLLLIVFVR
TDSVGLQVLA LLIMTAIALV FGWHLVASIG GADMPVVVSM LNSYSGWAAA AAGFMLSNDL
LIVTGALVGS SGAILSYIMC KAMNRSFISV IAGGFGTDGS STGDDQEVGE HREITAEETA
ELLKNSHSVI ITPGYGMAVA QAQYPVAEIT EKLRARGINV RFGIHPVAGR LPGHMNVLLA
EAKVPYDIVL EMDEINDDFA DTDTVLVIGA NDTVNPAAQD DPKSPIAGMP VLEVWKAQNV
IVFKRSMNTG YAGVQNPLFF KENTHMLFGD AKASVDAILK AL