Gene Nham_1072 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNham_1072 
Symbol 
ID4031639 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrobacter hamburgensis X14 
KingdomBacteria 
Replicon accessionNC_007964 
Strand
Start bp1204455 
End bp1207718 
Gene Length3264 bp 
Protein Length1087 aa 
Translation table11 
GC content65% 
IMG OID637969570 
Producttetratricopeptide TPR_2 
Protein accessionYP_576380 
Protein GI92116651 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCGCAAT CGCGTCCGGT ATCCGTTAGT AGCACGATCG AGGACATGCA GGCGGAGCGC 
AACGCGCTCG AAGCCTTTGC CAATGATCGC GCCCGCGACT TCGTTGGCAG GCAGTCGATC
ATTGCGCGCG TCACCGACCT CTGTCTCTCC CCCGCGAAAG AAAATCTCTC GCCCACGAAA
GAGGCCACGT CGTGGGGCAT CTGCATTACC GGCGATCCGG GTTCGGGCAA GAGCGCGCTG
TTCGGCGAAC TTCTCCACCG CATGAAGGGG ACCGACGCTT TCGTTCTCGC TCACGCCGCC
GGCGCGAATC CGCGCGCCTC ATCGGTCGAT GCCATGCTGC GTCGCTGGAT CGTCGAACTC
GGCAGCGCGC TCGGCGTCGG CAACGTTGAC CTCGCCGCGA ATATCGACCC GGAGATTGTC
GAGAGGGCTT TCGTCTCGCT GCTCACGCGG ATGGCGCCGC AGCGGCGCGT GGTTATGGTG
ATCGATGCGA TCGACCAGTT CAAGAAGACC CCGCGCGGAA AGTTTACGAC CTGGCTGCCG
CGGATGTGGC CGACCAATAC GCGGCTCGTC GCAACCGCCA TTGACGGCGG CGCCTCCAAG
GCGCTGGCCG AGCGTTCGGG CGTGGAAGCG TTGTCTCTGC CGCCGCTGGA CGCGACCGAG
GCACGTGGCA TCATCGATGC CGTATGCAAA CGCAATAACC GCCAGTTGGA GCCTTCGGTG
ATCGATGCGC TGCTGGCGAA GAAGCATGCG GGCGCGCCGG CGTCGGGCAA TCCGTTGTGG
CTGGTGCTCG CCCTCGAGGA ACTGGAGCTG CTCGACAGCG ATGATTTTGC CGACATGCAG
CGCGAATATG TCGGCCCTCC GGAGGAACGC GTCGCGAGTT TGATGCTCGA CACTGTCGGC
GCCATGTCGA CGGACATTTT ACGCCTCTAT CACGCAACGT TCGACAGCGC GGCCGAACTG
TTGGGCCCCG CCGTTACGTC GGCCTTTATC GGTCTCATTT CCGCGAGCCG GACGGGATGG
CGAGAGAGCG ATTTTCGTCA GTTGCTGCCG CAGGTCAGCG GCGAACCCTG GGACGAACAG
CGTTTCGCCG CGCTGCGCCG CCTGTTCCGC GGTCAGATTC ACCAGCATGG CGATCTTGCG
CAGTGGAATT TCAATCACGC CCAGGCGCAG GCCGCCGCCC GCTCACGCCT CGCGGCGCTG
CGCATCTCGA ATCCCCAACT GCACCTGTTG ATCGCCGATC ATCTTTTGAC CCTGGCGCCG
GACGACCTGC TGCGCGCGAC TGAAACCATG GTCCACCTTC TGGCGAGCAA GGACGACACC
CGTGCAGGGC AGTACTATGG CGATTCATCG TTGAGCGAGG CGGGGCTGCA AGGCGCCACG
CGTGCGCTCG CCGACGCCAT GATTTCGCCG GCGACAGGCA CTCCGGCGAG CGCGGCGCAA
GAGATTTGCC GTTTGCTTGA CAACCCCGAC AACGCTGTCC GCGCACTCGC GGCCGAGCGT
TTCCTCTTCA ATCTCGACGA CGCGGCCGAA CGGCACGTGT CGCCCGATGC CCGCATGACC
GTCTTGAATG CGATCGAAAG CGCGTTCGAG CAGCTGCTTC GCACCGATCC CGACAACGCC
GGCTGGCAGC GCAATCTATC GGTCGCACGC GATCGTGTCG GCGACGTGCT GGTGGCGCAG
GGCAAGCTGC CCGACGCGCT GAAATCCTTC CGCGACGGGC TCGCGATCAG GGAGCGGCTG
GCGAGCGCCG ATCCCGGAAA CGCCGGGCGG CAGCGCGATC TGTCGCTGTC GCACGAGAAG
ATCGGAGACG TGCTGGCGGT GCAGGGCAAG CTGCCCGAGG CGCTGGAAGC CTTCCGCAGC
CAGCTCGCGG CCGCCGAGCG GCTGGCGAAC GCCGACCCTG ACAATACAGA GCTGCGGGTC
GGTCTGTCGC TGTCGCACGA GAAGATCGGC GAAGCGCTGA TGGCGCAGGA CAAGCTGCCC
GAGGCGCTGG AAGCCTTCCG CAACCAGCTC GATATCATCG AGCAGCTGGC GCGCGCCGCC
GACACCGATG ACAACGAGTG GCAGCGCGAC CGGACGCTGT CCTACGATCG CATCGGCGAC
GTGCTGATGG CGCAGGGCAA GCTGCCCGAG GCGCTGGAGG CCTTGCGCGA CGGGCTCAAG
ATCAAGGAGC GGCTGGCGAA GGCTCATCCC GACAACACCG GCTGGCAGCG CAGTCTGTCG
CTCTCATATG ATCGTATCGG CGACGTGTTG GTGACGCAGG GCAAGCTGCC CGACGCGTTG
ACAGCCTTCC GTACAGGACT CGCGATCGCC GAGCAGTTGG CGCGCGCCGA GCCCGACCAC
GTCGGTTGGC AGCGCGATCT GTCGGTGTCC CACTCCAAGG TCGGCGACGT GCATGTGGCG
CAGAGCAGTC TGCCCGAAGC GTTGAAATCC TTCCATGAAG CGCTGACGCT CAGACAGCGG
CTGGCACACG CCGACCCCGA TAATGTTGAC TGGCAACGCG GCCTGGCGGT GTCCAACGAT
CGTATCGGCG ACGTGCTGAT GGCTCAGGAG AATCTGGGCG AGGCGCTGAA GGCCTTCCAG
GACCAGCTCG CAATTGCCGA GCGACTGGCG CAGGCCAACC CGGGCAACGC CGGCTGGCAG
CGCGATCTGT CGGTGTCTCA CGAGAAGGTC GGCGACGTGC TGGTGGCACA AGGCAATCAG
GCCGAGGCGC TGAAAGCCTT CCGCAACGCC CTTGCGATCA GAGAGCGGCT GGCGCGCAAC
GACCCCGGCA ACTCCGGCTG GCAGCGCAGT CTGTCGGTGT CACACGATTG TATCGGCGAC
GTGCTGGAAT CGCAGGGCAA TCAGGCCGAA GCGCTGAAAG CCTTCCGCAA CGCCCTTGCG
ATCAGGGAGC GACTGGCGCA AAGCGACCCC GCCAACGTCG CCTGGCAGCG CGATCTTACG
GTATCCTACG ATCGTATCGG CGAAGTGCTG GAAGGGCTGG GCGATCAGGC CGAGGCGGTG
AAAGCCTTTA ACGAGGGGAT CGCGATCAGC GAGCGGCTGT CGCTCGCCGA CCCCGGCAAC
GTCGACTTGC AGCGCAGCGT TGCCGTGAGC CAGGGGCATC TGGCCGAGAT GTATCGTCGA
TCCAATGACC ACAACAATGC GCTGGCTGCG CTGCGGCAGG GACAAGCCGC CATGGAGCGC
GTCGTCAGGC GCGCGCCAGA CAATGCCGGC TGGAAGAAAG ATCTGGACTG GTTCAACGAG
CAGATTGAGA CCTTGGCGGA TTAA
 
Protein sequence
MSQSRPVSVS STIEDMQAER NALEAFANDR ARDFVGRQSI IARVTDLCLS PAKENLSPTK 
EATSWGICIT GDPGSGKSAL FGELLHRMKG TDAFVLAHAA GANPRASSVD AMLRRWIVEL
GSALGVGNVD LAANIDPEIV ERAFVSLLTR MAPQRRVVMV IDAIDQFKKT PRGKFTTWLP
RMWPTNTRLV ATAIDGGASK ALAERSGVEA LSLPPLDATE ARGIIDAVCK RNNRQLEPSV
IDALLAKKHA GAPASGNPLW LVLALEELEL LDSDDFADMQ REYVGPPEER VASLMLDTVG
AMSTDILRLY HATFDSAAEL LGPAVTSAFI GLISASRTGW RESDFRQLLP QVSGEPWDEQ
RFAALRRLFR GQIHQHGDLA QWNFNHAQAQ AAARSRLAAL RISNPQLHLL IADHLLTLAP
DDLLRATETM VHLLASKDDT RAGQYYGDSS LSEAGLQGAT RALADAMISP ATGTPASAAQ
EICRLLDNPD NAVRALAAER FLFNLDDAAE RHVSPDARMT VLNAIESAFE QLLRTDPDNA
GWQRNLSVAR DRVGDVLVAQ GKLPDALKSF RDGLAIRERL ASADPGNAGR QRDLSLSHEK
IGDVLAVQGK LPEALEAFRS QLAAAERLAN ADPDNTELRV GLSLSHEKIG EALMAQDKLP
EALEAFRNQL DIIEQLARAA DTDDNEWQRD RTLSYDRIGD VLMAQGKLPE ALEALRDGLK
IKERLAKAHP DNTGWQRSLS LSYDRIGDVL VTQGKLPDAL TAFRTGLAIA EQLARAEPDH
VGWQRDLSVS HSKVGDVHVA QSSLPEALKS FHEALTLRQR LAHADPDNVD WQRGLAVSND
RIGDVLMAQE NLGEALKAFQ DQLAIAERLA QANPGNAGWQ RDLSVSHEKV GDVLVAQGNQ
AEALKAFRNA LAIRERLARN DPGNSGWQRS LSVSHDCIGD VLESQGNQAE ALKAFRNALA
IRERLAQSDP ANVAWQRDLT VSYDRIGEVL EGLGDQAEAV KAFNEGIAIS ERLSLADPGN
VDLQRSVAVS QGHLAEMYRR SNDHNNALAA LRQGQAAMER VVRRAPDNAG WKKDLDWFNE
QIETLAD