Gene Nham_1206 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNham_1206 
Symbol 
ID4032297 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrobacter hamburgensis X14 
KingdomBacteria 
Replicon accessionNC_007964 
Strand
Start bp1361321 
End bp1362661 
Gene Length1341 bp 
Protein Length446 aa 
Translation table11 
GC content58% 
IMG OID637969686 
Producttwin-arginine translocation pathway signal 
Protein accessionYP_576495 
Protein GI92116766 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1653] ABC-type sugar transport system, periplasmic component 
TIGRFAM ID[TIGR01409] Tat (twin-arginine translocation) pathway signal sequence 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCATCCC ATGCATCTCG TCGCTATGGC CTATCGCGCC GCGACGTCAT CAAGACCGCT 
GTGGGTGCGG CCGCCACCGT CGGCCCATTC TTCCATGTTG CGCCGGCTCG TGCGGCCAAG
ACGCTGAAGA TCCTGCAGTG GAGCCACTTC GTCCCCGGCT ACGACAAGTG GTTCAACAAC
ACCTACATCA AGGAATGGGG CGCCCAGCAT GGCACCGAAG TGGTCGTCGA CAACATCAAC
CTCGGCCTGA TCCCTTCGCG TGCGGCAGCG GAAGTATCGG CGCAGAAGGG GCATGACCTC
GTGATGTTCC TCGCCCCGCC TTCGGTTTAC GAAGAGCAGG TCGTCGACAT GAAGGATGTC
TACGACGCGT GCGAGAAAAA GTACGGCAAG CCGATCGATC TCGCCGTAAA GAGTACCTAC
AACCCCAAGA CCCGGAAGTA CTTCGCGTTC TCCGACAGCT TCGTCCCCGA TCCTGTCAAC
TACCGCTCGG ATCTTTGGGG CGACGTCGGC ATGAAGCCCG ATAGCTGGGA CAATGTCCGC
ATCGGCGGCA AGAAGATCAA GGACAAGACC GGAATCCCGG TCGGCATCGG CCTCTCCGCC
GAGCTCGACA CCGCGATGGC AATGCGCGCG ATCATGTATT CATTCGGCGC GCACGAGCAG
GACGTCGATG GCAATCTCGC GATCAATTCC AAGGAAACCC TCGAAGCCCT CAAATTCGTC
AAAGCGCTGT TCGAGGAAAC GGAAACGCCC GAAGTTTTCG CGTGGGATCC GTCGTCGAAC
AATCGGCAGA TGCTCGCCGG CAGGTCATCT CTGGTGCTGA ACGCGATTTC GGTCACGCGC
ACGGGCGAAA ACGACAAGAT GCCGATCCAC GAGAAGATTG CGCTCGCCAA GCCGCCGAAA
GGCTCGGTTC GGCAGATCGG CCTTGAGCAC GTGATGGATT GCTACGTGAT CTGGAAATTC
TCCGAAAACA TTGACGGCGC AAAAATGTTC CTGGTCGACT ACATCGACAA CTTCAAGCAG
GGCTTCATGG CCAGCGAGTA TTACAACTTC CCCTGCTTCT CGAAGACCGT TCCCGACCTG
GCACAGATCA TCTCCAGGGA TTCCAAGGCC GTGCCGCCGG ACAAGTACGC GGTGCTTTCG
GACGTGCTCG ATTGGGCAAC TAACGTCGGC TATCCCGGCT ACTCCAACGC CGCGATCGAC
GAAACTTTCA ACACCTGGGC GATCAATACC ATGTTCGCAG AAGCTGCCGC GGGCGCCGAA
ACTCCGGAGA ACGCTCTCAA GCGGGCGGAA GCCAAGATGA AGGCGATCTG GGCCAAATGG
AAAGATCGAA AGATGATTTG A
 
Protein sequence
MASHASRRYG LSRRDVIKTA VGAAATVGPF FHVAPARAAK TLKILQWSHF VPGYDKWFNN 
TYIKEWGAQH GTEVVVDNIN LGLIPSRAAA EVSAQKGHDL VMFLAPPSVY EEQVVDMKDV
YDACEKKYGK PIDLAVKSTY NPKTRKYFAF SDSFVPDPVN YRSDLWGDVG MKPDSWDNVR
IGGKKIKDKT GIPVGIGLSA ELDTAMAMRA IMYSFGAHEQ DVDGNLAINS KETLEALKFV
KALFEETETP EVFAWDPSSN NRQMLAGRSS LVLNAISVTR TGENDKMPIH EKIALAKPPK
GSVRQIGLEH VMDCYVIWKF SENIDGAKMF LVDYIDNFKQ GFMASEYYNF PCFSKTVPDL
AQIISRDSKA VPPDKYAVLS DVLDWATNVG YPGYSNAAID ETFNTWAINT MFAEAAAGAE
TPENALKRAE AKMKAIWAKW KDRKMI