Gene Saro_2681 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_2681 
Symbol 
ID3918455 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp2920156 
End bp2921307 
Gene Length1152 bp 
Protein Length383 aa 
Translation table11 
GC content70% 
IMG OID640445458 
ProducttRNA synthetase, class II (G, H, P and S) 
Protein accessionYP_497951 
Protein GI87200694 
COG category[E] Amino acid transport and metabolism 
COG ID[COG3705] ATP phosphoribosyltransferase involved in histidine biosynthesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones36 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGACACCA GCGACCCCGA TCTGCTGCCG GAAGGCCTCG AAGACCGTCT GCCCCGCGAT 
GCGGCCACCG CCACGCGCGT CATGCGCGCG ATCCACGGCG TGATGCACGG CCACGGCTAC
GACCGGGTCA TGCCCCCGTC CATCGAGTAC GAGCGCAGCT TCGCCGCGCG CATGGCCGGC
ATCCAGTCGC GCCGCATGTT CCGCTTCGTC GATCCGTCGA GCCTGCGCAT GATGGCCCTG
CGAAGCGACT TCACCCCGCA GATCGGCCGC CTTGCCGAAA CGCGCCTGGC CGAAGCGCCG
CGTCCGTTGC GCCTGTGCTA TGCGGGCCAG GTCGTCACGA TCAAGGCTGA CGGCCTCAAC
CCCTCGCGCG AAAAGCTCCA GTGCGGCGCC GAACTCGTCG GTGCCGACAA TGTCGCCGCC
GCCGCCGAAG TCGTCGCCAT CGCCATCGAG GCACTTCAGG CCGCGGGCGC CACGGGCGTC
AGCGTCGATT TCACGCTGCC CGACCTGGTC GATACGCTGG CCGAAAAGGC CCTCCCCCTG
GCCCCCGGCC AGATCGAGGC CGTCCGCCGC GAACTCGACA CCAAGGACGC AGGCGGCCTG
CGCGATGTCG GCGGCGAAGC CTACGTGCCG TTGCTCTACG CCACCGGCGA ATTCGACACG
GCGATCGACA AGCTTGCCGC GATCGATGCC GGCGGCGCGC TTGCCAGCCG CATCGACGCG
CTCCGGCAGA TCGCCGCTCG CCTCGGCGGC GCAGCGCGCC TGACGCTGGA CCCGAGCGAG
CGCCATGGCT TCGAATACCA GACCTGGTTC GGCTTCACCC TCTATGCCGA AGGGGTGCGC
GGCATCGTCG GGCGCGGCGG CACCTATCGC ATCGCGGGTT CCGATGCCGA TGCACGTCAG
GCCAATGCAC GACAGCAAGG CGAAGCCGCC ACCGGCTTCT CGCTCTATCC CAACGCCCTG
ATCGATCTCC TGGCCGCGAA CGAGCCCGCC GAAGATACCG TCTTCCTCCC GCTCGGCCAT
GACCGCGACG AAGCCGCCCG CCTGCGCGCC ATCGGCTGGC GAACGGTCGC GGCGCTCAGC
GAAGCGGACA GCGCGGACGC TCTCCGCTGC ACGCACATGC TCGGCGCGAA CGGACCGGAA
AAGCTGGCAT AA
 
Protein sequence
MDTSDPDLLP EGLEDRLPRD AATATRVMRA IHGVMHGHGY DRVMPPSIEY ERSFAARMAG 
IQSRRMFRFV DPSSLRMMAL RSDFTPQIGR LAETRLAEAP RPLRLCYAGQ VVTIKADGLN
PSREKLQCGA ELVGADNVAA AAEVVAIAIE ALQAAGATGV SVDFTLPDLV DTLAEKALPL
APGQIEAVRR ELDTKDAGGL RDVGGEAYVP LLYATGEFDT AIDKLAAIDA GGALASRIDA
LRQIAARLGG AARLTLDPSE RHGFEYQTWF GFTLYAEGVR GIVGRGGTYR IAGSDADARQ
ANARQQGEAA TGFSLYPNAL IDLLAANEPA EDTVFLPLGH DRDEAARLRA IGWRTVAALS
EADSADALRC THMLGANGPE KLA