Gene Saro_2945 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_2945 
Symbol 
ID3917380 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp3162340 
End bp3163869 
Gene Length1530 bp 
Protein Length509 aa 
Translation table11 
GC content64% 
IMG OID640445723 
Productextracellular solute-binding protein 
Protein accessionYP_498214 
Protein GI87200957 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGATCTTA GCCGCCGTAC ACTCATCGGT TCCGGCATCG CCGCTGCTGG CACGCTGGTG 
CTTCCCGCTT TTGCCGGGCC CGTGCGCGCC GGCCCCCGAC AGGGTGGTTC GATCCGCGTG
GCGATGCAGT CCTCGTCCAC TGCCGATACG CTCGACCCCG CCAAGGGCGC GATGGCCACG
GACTACGTGC GCCACTTCTG TCTCTATAGC GGTCTCACGC GGATGGAGGC GGACCTGCGC
CCGCGCCCTT TCCTTGCCGA GGCACTAGAG ACGGCTGACC GGATAACCTG GCACATAGCC
CTCAAGCGCG GCGTCCGCTT TGCCGATGGT CGTGAACTGA CCAGTGCCGA TGTCGTGCAT
TCGTTGCAGC GCCATCTCGA TCCCAAGCTG GGCTCGAAGG TCGCTTCGAT CGCCAAGCAG
TTCGCCGAAG TGCGCGCCGA TGGCCGGCAT GGCGTCGTCA TCAAGCTGTC CGGGGCCAAC
GCCGACCTTC CCGCGATCCT CTCGCAGTCG CATTTCCTGA TCGTGCCTGC TGGAGAGGAC
AAGCCCAGTG GCAACGGTTG TGGTCCGTTT CTGCTTGCCG ACTTCCGACC CGGCGTGCGC
ACCGTGGTCA CGCGCAATCC CGAATATTGG CGCAACGGCG AGCCCTATCT CGACCGCATC
GAGATCATTG CGATTCCCGA CGAGATAAGC CGGGTGAATG CGCTGCTTGC TGGCGACGTG
CAACTTGTCA TCGCGGTTCA GCCGCTATCG ACGCGGCGGA TCGAGCAGTC GTCGCAGCAC
GGCATTCTGA CCACGCCTTC TCCGCTCTAT ACTGATCTGG TCATGCGGCA GACCCAGTTG
CCAACCGGCA ATCCGGACTT CGTGGCGGCG GTCAAGCATC TGATTGACCG GCCGCTGATC
AAGCGCGCGC TGTTTCGGGG TTATGCGACC ATCGGAAACG ACCAGCCGAT TCCTCCCTTC
CACCCCTATT TCAATCCGGC GGTGCCGCAG ACGGTGCTCG ATCTGGACCG GGCGAAGTGG
CACATCGCGC GTTCGGGGCT GAAGGGCGTG CGGCTGCCGA TCTACTGTTC GCCTGCCGCC
GCTGGCTCGG TCGACATGGC GTCGGTGTTG CAGGAATATG GCGCGCAGGC AGGTCTCGAA
TTTGCGGTGA ACCGGATGCC GGCCGACGGC TACTGGTCGA CGCACTGGAT GAAGCACCCC
ATGAGCTTCG GGAACACCAA CCCGCGCCCC ACGGCGGACC TCGTGTTCAG TCAGTTCTTC
CGTTCGGACG CGGAGTGGAA CGAGAGCGGG TGGAAGAACC CGCGCTTCGA CGAACTGCTG
ACGCTGGCCC GTGCCGAGGC CGATGAAGCA CGTCGCAAGG AGCTTTATGG CGAGATGCAG
CAACTGGTGC ATGACCATTG CGGCGTGGCG ATTCCGGTAT TCATCAACAT GCTCGACGGC
CACGACCGGC GACTGAAAGG GATGTCTGCA ATCCCCCTCG GCGGCCTGAT GGGCTATCGC
TTTGCCGAAT ACGCCTGGTG GGACGCGTGA
 
Protein sequence
MDLSRRTLIG SGIAAAGTLV LPAFAGPVRA GPRQGGSIRV AMQSSSTADT LDPAKGAMAT 
DYVRHFCLYS GLTRMEADLR PRPFLAEALE TADRITWHIA LKRGVRFADG RELTSADVVH
SLQRHLDPKL GSKVASIAKQ FAEVRADGRH GVVIKLSGAN ADLPAILSQS HFLIVPAGED
KPSGNGCGPF LLADFRPGVR TVVTRNPEYW RNGEPYLDRI EIIAIPDEIS RVNALLAGDV
QLVIAVQPLS TRRIEQSSQH GILTTPSPLY TDLVMRQTQL PTGNPDFVAA VKHLIDRPLI
KRALFRGYAT IGNDQPIPPF HPYFNPAVPQ TVLDLDRAKW HIARSGLKGV RLPIYCSPAA
AGSVDMASVL QEYGAQAGLE FAVNRMPADG YWSTHWMKHP MSFGNTNPRP TADLVFSQFF
RSDAEWNESG WKNPRFDELL TLARAEADEA RRKELYGEMQ QLVHDHCGVA IPVFINMLDG
HDRRLKGMSA IPLGGLMGYR FAEYAWWDA