Gene Saro_2339 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_2339 
Symbol 
ID3915684 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp2487195 
End bp2488199 
Gene Length1005 bp 
Protein Length334 aa 
Translation table11 
GC content67% 
IMG OID640445095 
Producthypothetical protein 
Protein accessionYP_497610 
Protein GI87200353 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG5653] Protein involved in cellulose biosynthesis (CelD) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGAACGTCT ATCACGCCAG TCTCGAGGAA GCGCAAGCCG ACCCGCGCCT GAAGGGGTTG 
CAAGGCGCCT CGCCGTTCGA GCGGCTGGAC TGGCTGGCGC TGCTGGCGAA CGAATGCCTC
GACCCGGCGC GGGCGCGCCT GAGCGTGGTC ACCAGCGGCG AGTGCATGGC CGCGCTCCCC
TGGATCGAGC GCGAGGGACG GATCGACGCG CTGGCCAACT GGTACAGCTT TTTTGTTTCT
CCTCTTGGCG ATAGCGCTCT TCTTTCACGG ATAGTCGAGG CGCTTCCGCA CGGGCGCGCC
GCGTTCGCGC CACTGCCCGA GGAAGATGCG CGCCTGCTTG CCCGCGCCTT TCGGAACGCG
GGGTGGTGTA CGCTCGCGGC GCCTTGCGAC GTCAACCATG TGCTTCCGGT CGAGGGGCGG
TCCTTCGCCC AATACTGGGC CGAGCGTCCG GGCGCGCTGC GCGAGACGGT GCGTCGCAAG
AGCCGCAAGG GCGAAGTATC CTTGCGCATC TTGACCGAAT TTTCTCCCGA AGATTGGGAG
GCTTACGAGA CGATCTACAG GCTGAGCTGG AAGCCGGGCG AAGGCAGCCC GGCATTCCTG
CGCAAATGGG CCGAAGCCGA TGGCGAGGCG GGGCGGCTGC GGCTCGGCAT TGCCGAAATC
GACGGAGCGG CAGTGGCCGC GCAGTTCTGG ACCGTCGAGG GTGGCACGGC CTACATCCAC
AAGCTCGCCC ACGACGAACG CTTCCGAAAA TCCTCGCCCG GGACGCTCCT GACGGCCGCG
ATGTTCGAAC ACGTGATCGA CCGCGACCGC GTGGACCTGA TCGATTTCGG GACAGGAGAC
GATCCCTACA AGCGCGACTG GATGGACGAC GTGCGCTCGC GCTGGAGCGT GCAGGCCTGG
CGTCCGGGCG CAGTGCGGCA CTGGCCCTCG CTGGCCCTGG CGCTGGCCCG GACACTGGCC
GGGCAGATCA TGCGGCCTCT TGTGTCGCGA AATGGCGATG GTTAA
 
Protein sequence
MNVYHASLEE AQADPRLKGL QGASPFERLD WLALLANECL DPARARLSVV TSGECMAALP 
WIEREGRIDA LANWYSFFVS PLGDSALLSR IVEALPHGRA AFAPLPEEDA RLLARAFRNA
GWCTLAAPCD VNHVLPVEGR SFAQYWAERP GALRETVRRK SRKGEVSLRI LTEFSPEDWE
AYETIYRLSW KPGEGSPAFL RKWAEADGEA GRLRLGIAEI DGAAVAAQFW TVEGGTAYIH
KLAHDERFRK SSPGTLLTAA MFEHVIDRDR VDLIDFGTGD DPYKRDWMDD VRSRWSVQAW
RPGAVRHWPS LALALARTLA GQIMRPLVSR NGDG