Gene Saro_2541 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_2541 
Symbol 
ID3916862 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp2742309 
End bp2744999 
Gene Length2691 bp 
Protein Length896 aa 
Translation table11 
GC content65% 
IMG OID640445298 
ProductTonB-dependent receptor 
Protein accessionYP_497811 
Protein GI87200554 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG4206] Outer membrane cobalamin receptor protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.411805 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAATCCA TTTCTGCCGG TCGCACCGGC CAGGCCGCGC TTCTCGTCGG TGCCTGCGGT 
CTTGCTCTTG CATTCTCGGC GACCGCCCAG GCCCGCACGC AGGACGCCGG GCCCGTCGAA
GAGGCCCAGC CCGCGACCGA TGCGCCCGAG CCGCAGGCCG ACGGAAACGA GATCATCGTC
ACCGCCACCA AGCGCGAGCA GACCCTTCAG GACGTTCCCG TCGCCGTTTC GGTGACCAGC
GCGCGGACGC TCGAGCAGGC CCAGATCCGC GATCTCAAGG ATCTCACCAG CGTCGTGCCC
TCGCTGCGCG TGACCCAGCT CCAGTCGAGC GCGCAGACCA ATTTCATCAT CCGCGGCTTC
GGCAACGGCG CCAACAATGC CGGGATCGAG CCATCGGTCG GCGTGTTCAT CGACGGCGTC
TACCGTTCGC GCTCCGCCGC CCAGATCGGC GATTTCCCCG ACGTTCAGCG CATCGAGGTC
CTGCGCGGCC CGCAGTCGAC GCTGTTCGGC AAGAACGCCA GCGTGGGCGT CATCTCGATC
GTCACGCAGG CGCCCAAGTT CGACTTCGGC GGCAATGTCG AGGCGAGCTA TGGCAATTAC
GACGCGGTCG TCCTCAAGGG CGTGGTCACC GGTCCGATCA CCGACCAGCT CGCAGTCAGT
CTGGCGGGCG GTCTCAATAA GCGGGACGGG TACAACAAGG ACCTCGGCAC CGGCAACCGG
ACCAACGAGC GCGATCGCTG GTTCCTGCGC GGCCAGGCGC TTTGGGAGCC CAATGCCCAG
GCCCGCGTCC GCCTTATCGG CGATTACTCC AAGATCGACG AGAACTGCTG CGGCGTCGTC
AATCTCCAGC CCTCGTCGGC GACGCTGGCG GTCCAGGCGC TGGGTGGCCA GGTCAACGGC
ACCGACGAGA TTTACGCGAA CAAGGTCTAT TCCAATATCG ATTCCACCAA CCGGATCGAG
AACTGGGGCG CTTCCGGCCA GGTCGATTAC GACCTCGGTC CGCTGACCTT CACGTCGATC
ACCGCGTTCC GACGGTCCAA TTCGCTGACG AACCAGGACT CCGACTTTTC CAGCGCCGAC
CTCATCGGCC AGAATACCGC CGACCAGCGC ATCCGCACCT TCACCCAGGA GCTGCGCGTT
GCCACCAACA TGGACGGCCC GCTGAACTTT CTCGTCGGCG GCTACTACTT CAACGAGAAC
ATCCGCCAGA CGGGCCGCAT CGCCTGGGGT GCGGACGCGC GCAGCTATGC CGACTTCATC
ATCCGGGGCC TGACCTCGAA CACCCAGTCG CTGTCCAGCC TGGAGACCAC GTTCAGCGGC
CTGACCGGGA CCAACTACAC CGGGCAGTTC TTCGCCGAAG GGCAGGGCCT CGACGAGCGC
TATCGCCTTA AGAACGAGGC CTACTCCTTC TTCGGCCAGG CGGACTTCAA GATCGGCGAC
CGCCTGACCG TGACTGGCGG CATCAACTAC ACCAACGACA AGAAGCGCTT CGCCGCCTCG
GCCGACACGT CCGACGTGTT CTCCAGCCTC GACCTGGTGG ACATCGGCGG GACCGCGCTG
AGCCGTGGCT TCATTTCGAC CGCGCTCGGT ACGACCGATC CGCTCGCGAT CGCCGCCTTC
GCCAGCGCCA ACCCCGACGT TTACGCCACG CTGCAAACGC AGGCTGCCGG CTTTGCCGCG
CTCAACGCCC GCCTTTCCAC CAGCGACGCG GCGGCAGACG CCAATCCGCT GACCGTCGGC
AATCCGCTGC TTGCGCTCCA GGACCTGCAG TTCCTGCCGC CGTTCCTGGC GGTGCCCAAT
GCTGTCGAGC CGGGCCGGAC CAACGACGAC AAGTTCACTT ATGTCGTCCG CGTCGCCTAT
GACCTGACCG ACAACATCAA CGTCTATGCA AGCTACGCGA CCGGCTTCAA GGCCAGCTCG
ATCAACCTCT CGCGCGACAG CCGTCCGTTC GAGGCCGACC GCGAGCAATT GACCTCGCGC
GGGTTGGCCG TGGTCAACCA GACCTATGGC AGCCGCTTTG CCGGACCCGA AAGTTCGACC
GTCTATGAAA TCGGGCTCAA GGCGGACTGG GGTCTCGTCA CTGCCAACGT CGCCGCGTTC
GACCAGCGCA TCAATGGCTT CCAGTCCAAC ACCTTCACCG GTTCGGGCTT CGTGCTGGCC
AATGCGGGCA AGCAGTCGGT GCGCGGCCTG GAGTTCGAAG GCACGGTCAA GCCGGCCAAG
GGCTGGCTGC TCAATGTCGG CGTGACCTAC CTCGATCCCA AGTACGACAG CTTCGTGCTT
TCGGCGGTGG GCGACCTGTC CAACACGCGC CCGGCCGGCA TTCCGGCGAT TTCGTCGACC
TTCGGCGCCA GCTACGACCA CGAGTTCGCG GGTGGCGACC ACCTGATCCT GCGCGGCGAT
TTCCACTATG AATCGCCGGT TCAGATCGTC GAGGGCCTTC CGGGCTTCCT CGACGCGGGA
ACCAGCATCG CGGTGGCGGC GGCACGGCCG TTCCGGCGCG AGGTGAACGA GGCCAACGCG
TCGATCACCT GGGCGATGCA GATGGGCCTG GAACTGACGG TGTGGGGGCG CAACCTCACC
AACAACCGCT ATCTGCTCTC GGTATTCGAC ACTCCGGCAC AGCCCGGGTC GATCTCGGGC
TATACCAGCC AGCCGCGTAC CTACGGCGTG ACCGGGCGCT TCCGGTTCTG A
 
Protein sequence
MKSISAGRTG QAALLVGACG LALAFSATAQ ARTQDAGPVE EAQPATDAPE PQADGNEIIV 
TATKREQTLQ DVPVAVSVTS ARTLEQAQIR DLKDLTSVVP SLRVTQLQSS AQTNFIIRGF
GNGANNAGIE PSVGVFIDGV YRSRSAAQIG DFPDVQRIEV LRGPQSTLFG KNASVGVISI
VTQAPKFDFG GNVEASYGNY DAVVLKGVVT GPITDQLAVS LAGGLNKRDG YNKDLGTGNR
TNERDRWFLR GQALWEPNAQ ARVRLIGDYS KIDENCCGVV NLQPSSATLA VQALGGQVNG
TDEIYANKVY SNIDSTNRIE NWGASGQVDY DLGPLTFTSI TAFRRSNSLT NQDSDFSSAD
LIGQNTADQR IRTFTQELRV ATNMDGPLNF LVGGYYFNEN IRQTGRIAWG ADARSYADFI
IRGLTSNTQS LSSLETTFSG LTGTNYTGQF FAEGQGLDER YRLKNEAYSF FGQADFKIGD
RLTVTGGINY TNDKKRFAAS ADTSDVFSSL DLVDIGGTAL SRGFISTALG TTDPLAIAAF
ASANPDVYAT LQTQAAGFAA LNARLSTSDA AADANPLTVG NPLLALQDLQ FLPPFLAVPN
AVEPGRTNDD KFTYVVRVAY DLTDNINVYA SYATGFKASS INLSRDSRPF EADREQLTSR
GLAVVNQTYG SRFAGPESST VYEIGLKADW GLVTANVAAF DQRINGFQSN TFTGSGFVLA
NAGKQSVRGL EFEGTVKPAK GWLLNVGVTY LDPKYDSFVL SAVGDLSNTR PAGIPAISST
FGASYDHEFA GGDHLILRGD FHYESPVQIV EGLPGFLDAG TSIAVAAARP FRREVNEANA
SITWAMQMGL ELTVWGRNLT NNRYLLSVFD TPAQPGSISG YTSQPRTYGV TGRFRF