Gene Nham_2350 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNham_2350 
Symbol 
ID4032020 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrobacter hamburgensis X14 
KingdomBacteria 
Replicon accessionNC_007964 
Strand
Start bp2617682 
End bp2618932 
Gene Length1251 bp 
Protein Length416 aa 
Translation table11 
GC content63% 
IMG OID637970811 
Productaminodeoxychorismate lyase 
Protein accessionYP_577602 
Protein GI92117873 
COG category[R] General function prediction only 
COG ID[COG1559] Predicted periplasmic solute-binding protein 
TIGRFAM ID[TIGR00247] conserved hypothetical protein, YceG family 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACTGAGA GGCCGCCCAT TTCTCCGCGC AGCCCGCGCG CTGCACTTGA ACCCGAACAG 
GTGCCGCCGC CGCCGAAGCG CTCCGCGCGC GTCCGCAGCC CGCTGGTGAT CGCCGGCAAC
GCGATCATCA CCATCGTGCT GATCGGGATG CTTGGCGTTG GCGGAATCTA TGTTTACGGC
AAGCAGAAGA TCGAGGCGCC TGGACCGCTG CGGGAAGACA AGATCGTCAA CATTCCGGCC
CGGGCCGGAA TGGGCGATAT CGCAGACATT CTCCAGCGCG AAGGCGTGAT CGACAGCAAT
CGCCTGGCGT TTATCGGAAG CGTATTGGCG CTAAAGGCAC GCGCCGGCCT CAAGCCGGGC
GAATATGAAT TCCAGAAGAA TGCCAGTCTG CGCGATGTCA TCGGCACCAT TGTCGAGGGC
AAGGTGGTGC AGCATTCCGT GACTATACCG GAGGGCTTGA CCTCCGAGCA GATCGTAGCA
CGTCTTTCGG AGAACGAGAT TTTCTCGGGG ACCATTCGCG AGATTCCACG CGAGGGCTCG
TTGCTGCCCG AGACATACAA ATTCCCGCGC GGAACCAGTC GGCAGCAGGT GATCCAGCGC
ATGCAACAGG CGCAGAAGCG CCTGCTTGCG GAAATCTGGG AGCGTCGCAC CCCCGATGTG
CCCGTCAAGA CGCCCGAGCA GCTCGTTACA CTCGCATCGA TCATTGAAAA GGAAACGGGC
AAGGCGGATG AACGCAGCCG GGTCGCGGCG GTGTTCGTCA ACCGCCTGAA GCAGAAAATG
AAGCTGCAAT CCGATCCGAC CATCATCTAC GGGCTGGTCG GCGGCAAGGG AACGCTGGGG
CGGCCGATCA AACGCAGCGA AATCACGCAA CCGTCCCCGT ACAACACCTA CGTGATTGAA
GGGTTGCCGC CCGGACCGAT CGCGAACCCC GGCCGGGCCT CGCTCGAAGC CGCGGCAAAT
CCGGCGCGGA CGCGCGATTT GTTCTTCGTC GCCGACGGGA CCGGGGGACA CAGTTTCACC
GAGACCTACG AGCAGCACCA GAAGAACGTC GCCCGGCTTC GGACCATGGA AAAGCAGATC
CAGAACGATA CGGTCGAACC GGAGGACGAT CCGCCGCCGC CGGTTGCGGC GCCCGCCGCA
ACGGATACCG ATGCGACAGG AGAAACACCT GCCGCGAAGC CTGCGCCGCG GAAGCGACCC
CGGCCGGCCC GGCAGGGCGC GGCAGAGCCT TCGCGGCACG TTGTCCAGTA G
 
Protein sequence
MTERPPISPR SPRAALEPEQ VPPPPKRSAR VRSPLVIAGN AIITIVLIGM LGVGGIYVYG 
KQKIEAPGPL REDKIVNIPA RAGMGDIADI LQREGVIDSN RLAFIGSVLA LKARAGLKPG
EYEFQKNASL RDVIGTIVEG KVVQHSVTIP EGLTSEQIVA RLSENEIFSG TIREIPREGS
LLPETYKFPR GTSRQQVIQR MQQAQKRLLA EIWERRTPDV PVKTPEQLVT LASIIEKETG
KADERSRVAA VFVNRLKQKM KLQSDPTIIY GLVGGKGTLG RPIKRSEITQ PSPYNTYVIE
GLPPGPIANP GRASLEAAAN PARTRDLFFV ADGTGGHSFT ETYEQHQKNV ARLRTMEKQI
QNDTVEPEDD PPPPVAAPAA TDTDATGETP AAKPAPRKRP RPARQGAAEP SRHVVQ