Gene Saro_3958 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_3958 
Symbol 
ID5077442 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_009426 
Strand
Start bp130001 
End bp131227 
Gene Length1227 bp 
Protein Length408 aa 
Translation table11 
GC content68% 
IMG OID640481064 
Producthypothetical protein 
Protein accessionYP_001165726 
Protein GI146275565 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones34 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATCCAGC GATACCGTGT GCTCGGCCGC AGCGTGGCCG AAGGCGATGA ACTTCAGCCG 
CTTTTGGCAA AAGCCTACGC GCAAAAGGCC CAGGTCCTGT GCGAATGCCG CAAGGGGACC
GACCTGCCGC TCTACATTTC GCACAGGAAC GACCGCTACG TGCTGGCGCG CTGGCCGGGC
TCTGGCGCGC GGCACGCGAC CGCCTGCGAC CACTACGAGG CGCCCGATTA CCTGACCGGC
ATGGGCCAGG TCCGCGGCGC GGCGATCATC GACGATGAGA CGAGCGGCGA AACCAGCCTC
AAGCTCGGCT TCCCGCTCTC ACGTGGCAGC GCCCGGCTCG CGCCCGCGGC GATAACCAAC
GACAAGCCGA CGGTGAAATC GTCGGGCCAG AAGCTGTCGA TGCGCGGGCT GCTCCACGTC
CTATGGGACC GCGCCGAGCT CACGCACTGG CATCCCAAGA TGGCAGGCAA GCGCAGCTGG
TTCGTGGTGC GCCGCGCGCT GCTCGAGGCA GCCGCGTCAT GCCGGGCGAA GCAGGAGGCC
CTTCCCCATG TGCTGTTCGT GCCGGAAAGC TTCAAGCTGG AGGAGAAGGA GGACATCCGC
GCCCGCCGCC GCACCGCGCT CGAGCGGGTC TACCGCTCGC GCGACGAGAT GATGGTGGTG
GTCGGCGAGA TCAAGGAGAT CGTCTCGGCC CACGGCGCGG AGCGGATCGT CCTGCGCCAC
GTCGGCGACA TGCCGTTCGT GATGGACACC GATATGGCGC GCCGGTTCCA CAAGCGGTTC
GCGGGCGAAC TCGCGCTGTG GCAGGCGCAG CATGGCAGCA AAGCCGAGCA GGATCACCTG
GTGATCGCGG GCTCGTTCGC GCGGCGGCGC GAAGGCACCT TCGACCTGAT CGAGGTCGCA
CTGATGCCGG TGACGCCCGA ATGGCTCCCC TACGAGACCA GCGACGAGCG CTATCTGATC
GCCAAGGCGG TCGCCGAGAA GCGCCGGTTC GTGAAGGGCC TGCGCGTCAA TCTCGACGTC
GACATGCCGA TCGCGAGCCT GGTGCTGAAG GACACTGGCG AGGAAGCCTG CGCGGTGCAT
ATCCATGACC GCGACAACGA AGTGGCCGAG CCGCTCGAGG CGCTGCTCGC CGGGCAAGGC
GTCGCGCACC GGTTCTGGAA GGAAGGCGAG CCGCTCCCGG CGCGCGTCAC GCGGCAACGA
CGCTGGGAAG CGCAGGCCGC AGCCTGA
 
Protein sequence
MIQRYRVLGR SVAEGDELQP LLAKAYAQKA QVLCECRKGT DLPLYISHRN DRYVLARWPG 
SGARHATACD HYEAPDYLTG MGQVRGAAII DDETSGETSL KLGFPLSRGS ARLAPAAITN
DKPTVKSSGQ KLSMRGLLHV LWDRAELTHW HPKMAGKRSW FVVRRALLEA AASCRAKQEA
LPHVLFVPES FKLEEKEDIR ARRRTALERV YRSRDEMMVV VGEIKEIVSA HGAERIVLRH
VGDMPFVMDT DMARRFHKRF AGELALWQAQ HGSKAEQDHL VIAGSFARRR EGTFDLIEVA
LMPVTPEWLP YETSDERYLI AKAVAEKRRF VKGLRVNLDV DMPIASLVLK DTGEEACAVH
IHDRDNEVAE PLEALLAGQG VAHRFWKEGE PLPARVTRQR RWEAQAAA