Gene Saro_3520 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_3520 
Symbol 
ID5077669 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_009427 
Strand
Start bp131472 
End bp134345 
Gene Length2874 bp 
Protein Length957 aa 
Translation table11 
GC content65% 
IMG OID640481244 
ProductTonB-dependent receptor 
Protein accessionYP_001165906 
Protein GI146275746 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1629] Outer membrane receptor proteins, mostly Fe transport 
TIGRFAM ID[TIGR01782] TonB-dependent receptor 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCTAGTC ACGTTTCGCG CGTTTCGCTT TTGAAGATGG CCCTGCTGGC CGGTGGCGCA 
TTTGCCGCGC CGAGCCTTGC TTTCGCCCAG GACGCGACCG CCGCTCCCGA ACCCCAGGCC
GTAGCCGAAG ATGCAGCGCC CCCGACCGAC GCCATCGTCG TCACCGGCTT CCGCCAGTCG
CTCCAGGCTG CGATCAACGT CAAGAAGAAC GCCGTCGGCG CGGTCGACGC CATCGTTGCC
GAAGACATTG CAAAGTTCCC CGACCAGAAC CTTGCCGAAT CGCTACAGCG CATTCCCGGC
ATCTCGATCT CGCGCGATGC CGGCGAAGGC CGCGCGATCA CCGTGCGCGG TCTGTCCAGC
CAGTTCACCC GCGTCCGCGT CAACGGCATG GAAACGGTCG CCACCTCGAC CGACGGCGCC
TCGGCGAACC GCGACCGCGC GTTCGACTTC AACGTCTTCG CTTCGGAACT GTTCAGCTCG
ATCGTCGTGC ACAAGACCGC CGAAGCCAGC CTCGACGAAG GCTCGCTCGG CGCGGTGGTG
GATCTCAACA CCGGCAATCC GCTGGGTGGC AAGGCCGGCT TCACCGGCGT CGCATCGGTC
GTCGGCACTT ACAACGACCT GTCGGACTAC GTCGGCCCGC GCCTTGCCGG GCTGCTGAGC
TGGCGCAACG ACGCCGGCAC TTTCGGCATC TCGCTGTCCG CCGCGTACCA GAAGACCCGC
GTGCTGGAGC TTGGCAACAA CTCGGTCCGC TGGGCCCAGG CGCGCTTCGA CTCGGTCGAT
GGGACCCCGT GCTTCACCCG TCCCAACAGC GGCGGCACTT ACGTCGAGAG CGATGCCTGC
GACGAGGCCG CCCTCGCCTT CCACCCGCGC ATCCCGCGCT ACGGCGAAGT GAAGCATGAC
CGCGAGCGCC TCGGCATCAC CGGCTCGGTC CAGTTCGCGC CAACCGATGC GACGAAGTTC
TCGATCGACG GCCTGTTCTC CCGCTTCGAC GCGAAGCGCG AGGAGCAGTG GGGCGAAGTC
CTGCTGCGCT CCAACGAGCG TTCGATCGAC GTGGTCGACC CGGTCTACGA CGACAACGGC
AACATGGTCT CGGCCACGCT GAACGATGCC TGGGTGCGTA CCGAGCACTA CCTGCGCAAG
AGCCGCACCG AATTCTATCA GGTGGGCGGC ACCTGGGACC AGGACATCGG CGACAGCCTG
CGCTTCACCC TGCTCGGCGG CTTCTCCAAG TCGAATGCCG AAATCCCGGT CGAGACCACG
ATGATCTTCG ACGACCGCGA TGCCCAGGGC TATTCGTTCG ACTACACCGA CATGAAGCAC
CCGAAGCTGA CCTTCGGCAC CAGCGTCACC GATCCGGCGA ACTTCCAGCT CGCCGAAATC
CGCGACCGCC CGTCGAGCGT CGTGAACCGC TTCAAGACCG TGCAGCTTCG TACCGAATGG
GACGTGGCCG AAGGCTTCAC GATCAAGGCG GGCGGCATGT GGCGCCGGTT CAACTTCGAT
ACCGAAGCCT TTACCCGCGA CACCGCGGTC TGCGGCAACG GCGGCGTCGA CCGCATCTTC
GGCACGATCA ATTGCTCCGC CTCCTCGGTG TTCGGCCCCA CCGCTGTCTA TGGCATCCCG
GTGACGGCGG CGCTTGGCCA GGTGTTCAAC CTCGGCAACG CGGGTCAGCC CGCGGGCAAC
ACCAATTCGT GGCTGGTGCC CAATCTCGAC GCCACGACCG CGCTGACCAA GCTCTACGAA
CGCCCGCTGG CTCTCGATGC CGGCAACAAC CGGGGCGTGC AGGAGACCAC CAAGGGCGGC
TATCTCCAGT TCGACGCCAA GGGTGAGCTG CTGGGCCTGC GCTATGCGCT CAATGCAGGC
ATGCGGTACG TCAAGACCGA CCAGTCCTCC TACGGTCTCG TCGGCTCGGT GCAGACCACG
GTCAAGCGCA GCTACGAAGA CTGGCTGCCC TCGGCCAACC TCGCGCTCTA CCCCACCGAG
AACGTGATCG TGCGCGCCGC CATCGCCGAC GTGATGACCC GCCCGACGCT CGGTTCGCTG
ACGCCCGGCG GCTCGGCCGA CGGCTTCAAC TACCGCGTCA GCACCGGCAA CCCCTATCTG
GAACCCTATC GCGCCACCAA CTACGACCTC GGCGTGGAAT GGTACTTCGC TCCCCAGGCC
GTGCTTTCGG CGGCATGGTT CAAGAAGGCG GTGCACACCT TCACCCGCAG CGCCTCGATC
ACCGGTCTGA CCTACGCCCA GACCGGCGCG CCGATTTCCT CGCTCAGCCC GAACTCGCCG
GCTGCGCTGA ACCCGTCGCA GTTGCTCGAG GATCGCTGGA CGCTGGCAAC CACGGTCAAC
GGCGAGGGTG CCACGCTCAA GGGCTGGGAA TTCGCCGCGC AGGTGCCCTT CAAGATCTTT
GCCGAAGGCT TCCTCGGGAA CTTCGGCGTG ATCGCCAACG CCACCTTCAT CTCGTCGAGC
GCGGAATACG AACTCCAGGG CCCGATCACC GTCGCCCGCC TGCCCAACGG CAACCTCGGC
CCGCTCAACA ACGTGACGCT CACCAGCACG CTCGCGAACG TGTCGAAGCG CGCCTACAAC
GGCACGCTCT ACTACGATGA CGGCCGCTTC TCCGCGCGCG TCATGGGCAC CTACCGCAGC
GCCTACCACG AAGGCGCCAG CGGCACCGGC AACCTGCTCG AAGGCTACGG CTCGATGTGG
AACGTCGACG CTTCGGTTCG CTACAAGGTG AACGACTGGC TGGAAGTCTC GGTCGAGGGC
AACAACCTCC TCGACACCTA TCGCTACCGC TACACCGACA TAGAAACGCA GCGGAATTAC
GAGAACAATC ACTTCGGGCG CAACATCCTG GTGGGCGCGC GCCTCAAGTA CTGA
 
Protein sequence
MPSHVSRVSL LKMALLAGGA FAAPSLAFAQ DATAAPEPQA VAEDAAPPTD AIVVTGFRQS 
LQAAINVKKN AVGAVDAIVA EDIAKFPDQN LAESLQRIPG ISISRDAGEG RAITVRGLSS
QFTRVRVNGM ETVATSTDGA SANRDRAFDF NVFASELFSS IVVHKTAEAS LDEGSLGAVV
DLNTGNPLGG KAGFTGVASV VGTYNDLSDY VGPRLAGLLS WRNDAGTFGI SLSAAYQKTR
VLELGNNSVR WAQARFDSVD GTPCFTRPNS GGTYVESDAC DEAALAFHPR IPRYGEVKHD
RERLGITGSV QFAPTDATKF SIDGLFSRFD AKREEQWGEV LLRSNERSID VVDPVYDDNG
NMVSATLNDA WVRTEHYLRK SRTEFYQVGG TWDQDIGDSL RFTLLGGFSK SNAEIPVETT
MIFDDRDAQG YSFDYTDMKH PKLTFGTSVT DPANFQLAEI RDRPSSVVNR FKTVQLRTEW
DVAEGFTIKA GGMWRRFNFD TEAFTRDTAV CGNGGVDRIF GTINCSASSV FGPTAVYGIP
VTAALGQVFN LGNAGQPAGN TNSWLVPNLD ATTALTKLYE RPLALDAGNN RGVQETTKGG
YLQFDAKGEL LGLRYALNAG MRYVKTDQSS YGLVGSVQTT VKRSYEDWLP SANLALYPTE
NVIVRAAIAD VMTRPTLGSL TPGGSADGFN YRVSTGNPYL EPYRATNYDL GVEWYFAPQA
VLSAAWFKKA VHTFTRSASI TGLTYAQTGA PISSLSPNSP AALNPSQLLE DRWTLATTVN
GEGATLKGWE FAAQVPFKIF AEGFLGNFGV IANATFISSS AEYELQGPIT VARLPNGNLG
PLNNVTLTST LANVSKRAYN GTLYYDDGRF SARVMGTYRS AYHEGASGTG NLLEGYGSMW
NVDASVRYKV NDWLEVSVEG NNLLDTYRYR YTDIETQRNY ENNHFGRNIL VGARLKY