Gene Saro_2412 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_2412 
Symbol 
ID3916731 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp2583756 
End bp2585444 
Gene Length1689 bp 
Protein Length562 aa 
Translation table11 
GC content70% 
IMG OID640445167 
ProductPTS system, N-acetylglucosamine-specific IIBC component 
Protein accessionYP_497682 
Protein GI87200425 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1263] Phosphotransferase system IIC components, glucose/maltose/N-acetylglucosamine-specific
[COG1264] Phosphotransferase system IIB components 
TIGRFAM ID[TIGR00826] PTS system, glucose-like IIB component
[TIGR01998] PTS system, N-acetylglucosamine-specific IIBC component 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGGAACC TTCTCGGCCT GCTCCAGCCG ATCGGTCGCG CGCTGATGCT GCCGATCGCG 
GTGCTCCCCG TCGCGGGGCT TCTGCTGCGC CTCGGCCAGC CCGACCTGCT CGATCTTCCG
CTGCTGGCCG CGTCCGGCGA TGCGCTCTTC TCCAGCCTCG GACTGCTTTT CGCCATCGGC
GTTGCCGCGG GCATTGCGCG CGACGGCAAT GGTGCGGCGT GCCTCGCGGG GGTGGTCTGT
TATCTCGTCA CGATGAACGG CGGCAAGGCG CTCCTGCCGG TCCCGACGGA CGTCACGCTC
GGCCTTGCCG ATACGCTTGC CCAGACCGTC GCAAGGGCGT GGAAGGCCAA GGCCTTTGCC
CGGCTTGACG TACCCATCGG GATCGTCTCG GGTCTGCTCG GCGGCGCGCT CTACAACCGC
TTCTCGACCA TCGCCGTTCC TGCCTATCTC GCCTTCTTCG GCGGAAGGCG TTTCGTGCCC
ATCGTCGCGG GTGGCGCGGG GGTCGTGCTT GCCGGGGTCG TGGGCCTTGG CTTCCCGGCA
CTCGATGCCG CGCTCGATCA CGGTAGCCGG GGTCTCGTTG CCGCAGGTCC GGTCGGGCTC
TTCGCCTTCG GGGTTCTGAA CCGGCTGCTC CTCGTCACCG GGCTCCACCA CATCCTCAAC
AACGTCGCCT GGTTCGTCCT GGGCGATTTC GGGGGGACGA CCGGCGACCT GCGCCGCTTC
TTTGCCGGAG ACCCGCATGC GGGCGCCTTC ATGGCCGGGT TCTTCCCGAT CATGATGTTC
GGCCTGCCCG CCGCCTGCCT CGCCATGTAC CGGGCCGCGT TGCCCGATCA GCGCAAGGCG
ACCGGCGGCA TGCTTCTCAG CCTCGCGCTC ACCTCGTTCC TGACCGGCGT GACCGAGCCG
ATCGAGTTCA GCTTCATGTT CCTCGCACCG ATGCTCTACG CGGTCCACGC GGTGCTGACG
GGCGCGGCGA TGGTTCTGAT GGACGTGCTC GGCGTGCGCA TGGGCTTCGG CTTTTCCGCC
GGCCTTTTCG ACTATGTGCT GAACTTCGGC CGAGCCACCC GGCCGCTTCT CCTGCTGCCC
GTCGGCCTTG CCTATTTCGC GATCTACTAC GCGGTCTTCA GCTATGCGAT CCGCCGTTTC
GACCTCGCGA CGCCAGGCCG CCAGCCGCTC GCGCCCTCCG GCCAGCAGGA AACCGGCACC
GACGGCGAGT TGGGCCACGC CTATGTCGAA GCCCTTGGCG GCGCCGCCAA CATCGCCACC
TTGGGCGCCT GCACCACGCG CCTGCGCCTG GTCGTCCGCG ATCCCGCGGC GGTCGACGAT
GCCGCGCTCA AGGCGCTCGG CGCGGTCGCG GTGCTGCGCC CTGCCGCCGA TGCCGTGCAG
GTCATAATCG GCCCCATGGC AGACCGCATC TGCGCCGAGA TGGCCGACGT CGTGAAGCAC
GCACCCGCCG CCCCCGCCGC CACGCCTGTC CGCTCCGAAG CGTTAGCCGT CAGCCTGCGC
CCGGAAATCC TTGCCGCGCT CGGCGGAGCC GATGCCGTCG TCCAGGCCTC GCGCGCGGCC
GGACGCATCC GCGTGACGTT TCGCCGTGAC GCGGATCTTG CGGCAATGGC CCCGGTCCCC
GGCCTGCGCT GCGTCGCCCG CATCGACGCG CGGACCTGGC ACCTCATCGG CCAGGACCTC
GCGGTCTGA
 
Protein sequence
MRNLLGLLQP IGRALMLPIA VLPVAGLLLR LGQPDLLDLP LLAASGDALF SSLGLLFAIG 
VAAGIARDGN GAACLAGVVC YLVTMNGGKA LLPVPTDVTL GLADTLAQTV ARAWKAKAFA
RLDVPIGIVS GLLGGALYNR FSTIAVPAYL AFFGGRRFVP IVAGGAGVVL AGVVGLGFPA
LDAALDHGSR GLVAAGPVGL FAFGVLNRLL LVTGLHHILN NVAWFVLGDF GGTTGDLRRF
FAGDPHAGAF MAGFFPIMMF GLPAACLAMY RAALPDQRKA TGGMLLSLAL TSFLTGVTEP
IEFSFMFLAP MLYAVHAVLT GAAMVLMDVL GVRMGFGFSA GLFDYVLNFG RATRPLLLLP
VGLAYFAIYY AVFSYAIRRF DLATPGRQPL APSGQQETGT DGELGHAYVE ALGGAANIAT
LGACTTRLRL VVRDPAAVDD AALKALGAVA VLRPAADAVQ VIIGPMADRI CAEMADVVKH
APAAPAATPV RSEALAVSLR PEILAALGGA DAVVQASRAA GRIRVTFRRD ADLAAMAPVP
GLRCVARIDA RTWHLIGQDL AV