Gene Nham_0203 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNham_0203 
Symbol 
ID4030663 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrobacter hamburgensis X14 
KingdomBacteria 
Replicon accessionNC_007964 
Strand
Start bp226878 
End bp228275 
Gene Length1398 bp 
Protein Length465 aa 
Translation table11 
GC content68% 
IMG OID637968738 
Productglycosyl transferase family protein 
Protein accessionYP_575563 
Protein GI92115834 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1215] Glycosyltransferases, probably involved in cell wall biogenesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.678813 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGTCGTTTG TTGCGTGGGG GCGTGACTTG GCGGGGCGGG ACCAAGTTGA GGATGGCAGA 
TGGATCGCCG CACCGCGACG CGACCGGCCT GTGGAAGCGC CGCTCCCGGT CTGTCCTGCC
GGCGTTTCTC ACATTCTGCA ATCGAGCAAC GACAACTGGC TGTGCGACCC GGAGCCGGGC
CCGGCGAGCG AACTTGATTG CCTGCGACAC GTTCTCGCTC CGGCTCTGCT GCGCGCGGCC
GAGGCGCGCG GCCGGGAGGT GGGGATCGGC GCGGATCAGG CGCTGATCCG GTCGGGCGTC
ATCAATGAAG ACGCTTACCT GCAAACACTC TCCTCTCACA CCGGTCTTGC GATCGAGACA
TGCGCAGAGG AATCCCGCGC CGATTGCCCC TTGCCCGATC GTCACCTGCC TGGCGCCGCG
GAACACGGAC TGCTTCCGCT GCGCCGGGAT GGGAAGTTGA TCTGGGCGGT CGCACCGCGC
GGTTTCGCCG CGCGACGGCT CTGCCGCCTG ACTGCTGCAT ATCCATCGCT ATGCGACCGG
GTGCGTCTGA CATCGACGCG AGACCTCAAT CAGTTTCTGC TGCGGCAGAC CGGCGACGTG
CTCGGCCAGT CCGCCGCCAA TGCGCTCGGT CGGCGATTCC CGGCGCTGTC CGCCGCGCCG
GTTGCCGACG GTGCCCCCGG CGGGCTGCGG ACAATGCGGC GCCCCGCGCA GACCGGCGCT
CTCGCCGTCA TGATGGTGCT GACGCCAACC TTCGCCCTGG ACATCTTGAG CGACGCGCTG
GCGATATGGT TTCTCGCATT CATCAGCCTG CGGCTTGCGG CCAGCCTGAG ACCGCCGCGC
CCGGCGGTGC GGCTGCCGCG TGTCCCGGAC GGCCGTCTGC CGACCTACAC CGTGATCGCG
GCGCTGTATC GCGAAGCGGC ATCGGTCGCA CCGCTGATGC AGGCCATCGG CGCCCTGGAC
TACCCGCGCG AAAAGCTCGA CGTCATCATC GTGATCGAGC TCGACGATCT CGAAACCCGC
GCCGCTCTTG CGAGGCTCGG CCCGATGCCG CAGGTTCAGG TCCTGCTCGC CTCGGCGGAA
GGGCCGCGCA CCAAACCGAA GGCGCTGAAT TGCGCGCTGC CGTTCGCGCG CGGCAGCTTC
ACCGCCGTGT TCGATGCCGA GGACCGTCCC GATCCGGGCC AGCTCCGCGC CGCGCTCGAC
GCCTTTCGCA TTCAAGGCAC GGACGTGGCC TGCGCTCAGG CCAGTCTCTG CATCGAAAAC
CAGTCCGACA GCTGGCTCTC GCGCATGTTC GCCGCCGAAT ATGCCGGACA GTTCGACGTC
TTTCTTCCGG GACTCGCATC ATTCGGGGTG CCGCTGCCGC TCGGCGGATC GTCGAACCAC
TTCCGCGGTA TTAGGTAG
 
Protein sequence
MSFVAWGRDL AGRDQVEDGR WIAAPRRDRP VEAPLPVCPA GVSHILQSSN DNWLCDPEPG 
PASELDCLRH VLAPALLRAA EARGREVGIG ADQALIRSGV INEDAYLQTL SSHTGLAIET
CAEESRADCP LPDRHLPGAA EHGLLPLRRD GKLIWAVAPR GFAARRLCRL TAAYPSLCDR
VRLTSTRDLN QFLLRQTGDV LGQSAANALG RRFPALSAAP VADGAPGGLR TMRRPAQTGA
LAVMMVLTPT FALDILSDAL AIWFLAFISL RLAASLRPPR PAVRLPRVPD GRLPTYTVIA
ALYREAASVA PLMQAIGALD YPREKLDVII VIELDDLETR AALARLGPMP QVQVLLASAE
GPRTKPKALN CALPFARGSF TAVFDAEDRP DPGQLRAALD AFRIQGTDVA CAQASLCIEN
QSDSWLSRMF AAEYAGQFDV FLPGLASFGV PLPLGGSSNH FRGIR