Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_1596 |
Symbol | pntA |
ID | 6145281 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 1584587 |
End bp | 1586119 |
Gene Length | 1533 bp |
Protein Length | 510 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641616473 |
Product | NAD(P) transhydrogenase subunit alpha |
Protein accession | YP_001743651 |
Protein GI | 170682207 |
COG category | [C] Energy production and conversion |
COG ID | [COG3288] NAD/NADP transhydrogenase alpha subunit |
TIGRFAM ID | [TIGR00561] NAD(P) transhydrogenase, alpha subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.0118636 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 64 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGAATTG GCATACCAAG AGAACGGTTA ACCAATGAAA CCCGTGTTGC AGCAACGCCA AAAACAGTGG AACAGCTGCT GAAACTGGGT TTTACCGTCG CGGTAGAGAG CGGCGCGGGT CAACTGGCAA GTTTTGACGA TAAAGCGTTT GTGCAAGCGG GCGCTGAAAT TGTAGAAGGG AATAGCGTCT GGCAGTCAGA GATCATTCTG AAGGTCAATG CGCCGTTAGA TGATGAAATT GCGTTACTGA ATCCTGGGAC AACGCTGGTG AGTTTTATCT GGCCTGCGCA GAATCCGGAA TTAATGCAAA AACTTGCGGA ACGTAACGTG ACCGTGATGG CGATGGACTC TGTGCCGCGT ATCTCACGCG CACAATCGCT GGACGCACTC AGTTCGATGG CGAACATCGC TGGTTATCGC GCCATTGTTG AAGCGGCACA TGAATTTGGG CGCTTCTTTA CCGGACAAAT CACTGCGGCC GGGAAAGTGC CACCGGCAAA AGTGATGGTG ATTGGTGCCG GTGTCGCAGG TCTGGCCGCC ATTGGCGCAG CAAACAGTCT CGGCGCGATT GTGCGTGCAT TCGACACCCG CCCGGAAGTG AAAGAACAAG TTCAAAGTAT GGGCGCAGAA TTCCTCGAGC TGGATTTTAA AGAGGAAGCG GGCAGCGGCG ATGGCTATGC CAAAGTGATG TCGGACGCGT TCATCAAAGC GGAAATGGAA CTCTTTGCCG CCCAGGCAAA AGAGGTCGAT ATCATTGTCA CCACCGCGCT TATTCCAGGC AAACCAGCGC CGAAGCTAAT TACCCGTGAA ATGGTTGACT CCATGAAGGC GGGCAGTGTG ATTGTCGACC TGGCAGCCCA AAACGGCGGC AACTGTGAAT ACACCGTGCC GGGTGAAATC TTCACTACGG AAAATGGTGT CAAAGTGATT GGTTATACCG ATCTTCCGGG CCGTCTGCCG ACGCAATCCT CACAGCTTTA CGGTACTAAC CTCGTTAATC TGCTGAAACT GTTGTGCAAA GAGAAAGACG GCAACATCAC TGTGGATTTT GATGATGTGG TGATTCGTGG CGTGACCGTG ATCCGTGCGG GCGAAATTAC CTGGCCGGCA CCGCCGATTC AGGTATCAGC TCAGCCGCAG GCGGCACAAA AAGCGGCACC GGAAGTGAAA ACTGAGGAAA AATGTGCCTG CTCACCGTGG CGTAAATACG CGTTGATGGC GTTGGCAATC ATCCTTTTCG GCTGGATGGC AAGCGTTGCG CCAAAAGAGT TCCTTGGACA CTTCACTGTG TTCGCGCTGG CCTGCGTTGT CGGTTATTAC GTGGTGTGGA ATGTATCGCA CGCGCTGCAT ACACCGTTGA TGTCGGTCAC CAACGCGATT TCAGGGATTA TTGTTGTCGG AGCACTGTTG CAGATTGGCC AGGGCGGCTG GGTTAGCTTC CTTAGTTTTA TCGCGGTGCT TATAGCCAGC ATTAATATTT TCGGTGGCTT CACCGTGACT CAGCGCATGC TGAAAATGTT CCGCAAAAAT TAA
|
Protein sequence | MRIGIPRERL TNETRVAATP KTVEQLLKLG FTVAVESGAG QLASFDDKAF VQAGAEIVEG NSVWQSEIIL KVNAPLDDEI ALLNPGTTLV SFIWPAQNPE LMQKLAERNV TVMAMDSVPR ISRAQSLDAL SSMANIAGYR AIVEAAHEFG RFFTGQITAA GKVPPAKVMV IGAGVAGLAA IGAANSLGAI VRAFDTRPEV KEQVQSMGAE FLELDFKEEA GSGDGYAKVM SDAFIKAEME LFAAQAKEVD IIVTTALIPG KPAPKLITRE MVDSMKAGSV IVDLAAQNGG NCEYTVPGEI FTTENGVKVI GYTDLPGRLP TQSSQLYGTN LVNLLKLLCK EKDGNITVDF DDVVIRGVTV IRAGEITWPA PPIQVSAQPQ AAQKAAPEVK TEEKCACSPW RKYALMALAI ILFGWMASVA PKEFLGHFTV FALACVVGYY VVWNVSHALH TPLMSVTNAI SGIIVVGALL QIGQGGWVSF LSFIAVLIAS INIFGGFTVT QRMLKMFRKN
|
| |