Gene Saro_3525 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_3525 
Symbol 
ID5077674 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_009427 
Strand
Start bp139955 
End bp142255 
Gene Length2301 bp 
Protein Length766 aa 
Translation table11 
GC content66% 
IMG OID640481249 
ProductTonB-dependent receptor 
Protein accessionYP_001165911 
Protein GI146275751 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1629] Outer membrane receptor proteins, mostly Fe transport 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGTATCG TCGCCGTGCT GCTCGCCAGC ACTTCGCTTG TCGCCGCCCC GGCCTTCGCC 
GCCGAAGAAG CCGCACCTGC ACCGGTCGCC CCTGAAGCCA TTTCCGATGC GGCCGCCGCC
GAAATCGTCG TCATGGGCCA GGGACAGACC CGGCAGGTGC AGGAACTTTC CACGCAGGAG
CTGACCATTC TCGCCTCGGG CACCAGCCCC TTGAAGGCGA TCGAGAAGCT GCCCAGCGTC
AATTTCCAGT CCGCCGACGC CTTCGGCACC TATGAATGGT CGACCCGCGT CACCATTCGC
GGCTTCAGCC AGAACCAGCT TGGCTTCAAC ATCGACGGCA TCCCGCTGGG CGACATGTCC
TATGGCAACG CCAACGGCCT GCACATCAGC CGCGCGATCA GCCCCGAGAA CATCGGCGTC
ACCCGCGTCA GCCAGGGCTC CGGCTCGATC ACCGCGCAGT CGACCAACAA TCTCGGCGGC
ACGCTCGAGT TCTTCTCGAT CGATCCCAAG GACGCCCTCG GCGTTACCGC CAGCGCATCC
TACGGTTCGG AAAACACCTG GCGCGGCTTT GCGCGCATTG GCCTGGGCAC CACCGATGGC
GCCCGCGCCT TCGCTTCGGT CCAGTACCAG GACGGCGAAA AGTGGAAGGG CGACGGCAAG
CAGCGCACGC TGATGGTCAA CGCCAAGGGC GTGCTCCCGC TGGGCGGCGG TACCGAACTC
GACGGCTACG TCAGCTATTC CGACCGCGCC GAGCAGGACT ACCAGGATCT CAGCCTCGCG
ATGATCCAGC GTCTCGGCTA TGACTGGGAC AACTTCGGCC CCTCGCGCTA TGCCGAAGCG
GTCCGCGTGG CCGACATCGC CGCCAACCGC GGCGACACCG GCACCGCCCC GCTCAACGCC
GCGGCGGGCA CCACCTACCC CAGCCCGATC GCTTCGGCGG ACGATGCCTA TTACGATGCC
TCGGGCCTGC GCAAGGACAC CCTCGCCTCG CTCGGCCTCA CCACGCCGCT GGGCGATGCG
CTGACCTTCA AGGTCAAGGG CTACTACCAC GAGAACGACG GCCAGGGCAC GTGGGGCAGC
CCATACGTCA ACAGCCCCAC CGGCGTGCCG ATGGCGCTGC GCACCACCGA ATACGACATC
AAGCGCAAGG GCGTGTTCGC CGCGCTTTCG GGCACGTTCG GCATCAACGA ACTCACCGTC
GGCGGCTGGT ACGAGAAGAA CGACTTCATC CAGTCGCGCA AGTTCTATGC CTATGAGAGC
CGGACCAACC CCGGCCGCGA CCACCTGAAG TTTCAGCACA ACCCGTTCTA CACGCAGTGG
TCCATCGCCT TCGAGACCGA CACGCTGCAA TACTACGTCT CCGACGACGT CGACCTCGGC
GATCTCAAGG TGAACCTTGG CTGGAAGGGC TATTCGGTCG ACACCAACGC GTTCGCGCTG
GTCAACGTCA GCGGCCTCGC CACCGGCGAC ATCAAGGTCG AGGACTGGTT CCAGCCGCAC
GTCGGCCTGA ACTACAAGCT TGGTGACGGG CTCGAGGCTT TCGCGGGCTT CACCCAGGTG
ACGCGCGCCT ACCAGGCATC GGCCACCAGC GGCCCGTTCT CGACCACGCA GGCCGGCTTC
AATGCGATCA AGGACAAACT CAAGCCCGAA AGCTCGGACA CCTGGGAGGC GGGCCTGCGC
TACAACACCG GCGTCATCAA CGCCTCGCTC GCAGGCTACT ACGTCAACTT CCGCGACAGG
CTGCTGGTGA TCCCGACTTC GGTCGGCGTC GTCGGCTCGG CCAACGTGCT GCAGAACGTC
GGCTCGGTCC GCGCACTCGG CATCGAGGCG GCGGTGGACG TGAAGCTCCC CGGCGGCTTC
GGCGCGTTCG CTTCGTACAG CTACAACGAC ACGACCTACC GCGATGACGT GACCATCACC
GCGGGCGGCA CCACGGTGGT CCGCGCGACC GCTGGCAAGA CCGTCGTCGA CACGCCAAAG
CATCTCCTGC GCGGCGAACT GTCGTATGAC AGCCAGACCG TGTTCGGCCG CGTCGGGGTC
AACTACATGT CCAAGCGCTA CTTCACCTAC CTCAACGACC AGTCGGTCCC CGGCCGCGCG
CTGGTGGACG CGACCATCGG CTACCGCCTC GACATCGGCC AGCGCCAGCC GGTCGAACTG
CAGCTCAATG CCGTGAACCT GTTCGACAAG CGCTACGTCG CCACGATCGG GTCCAACGGC
TTCGGCTTCA GCGGCGACAA CCAGACCCTG CTCGCGGGCG CACCGCGCCA GGTCTTCGTC
ACGCTCAAGG CGGGGTTCTG A
 
Protein sequence
MRIVAVLLAS TSLVAAPAFA AEEAAPAPVA PEAISDAAAA EIVVMGQGQT RQVQELSTQE 
LTILASGTSP LKAIEKLPSV NFQSADAFGT YEWSTRVTIR GFSQNQLGFN IDGIPLGDMS
YGNANGLHIS RAISPENIGV TRVSQGSGSI TAQSTNNLGG TLEFFSIDPK DALGVTASAS
YGSENTWRGF ARIGLGTTDG ARAFASVQYQ DGEKWKGDGK QRTLMVNAKG VLPLGGGTEL
DGYVSYSDRA EQDYQDLSLA MIQRLGYDWD NFGPSRYAEA VRVADIAANR GDTGTAPLNA
AAGTTYPSPI ASADDAYYDA SGLRKDTLAS LGLTTPLGDA LTFKVKGYYH ENDGQGTWGS
PYVNSPTGVP MALRTTEYDI KRKGVFAALS GTFGINELTV GGWYEKNDFI QSRKFYAYES
RTNPGRDHLK FQHNPFYTQW SIAFETDTLQ YYVSDDVDLG DLKVNLGWKG YSVDTNAFAL
VNVSGLATGD IKVEDWFQPH VGLNYKLGDG LEAFAGFTQV TRAYQASATS GPFSTTQAGF
NAIKDKLKPE SSDTWEAGLR YNTGVINASL AGYYVNFRDR LLVIPTSVGV VGSANVLQNV
GSVRALGIEA AVDVKLPGGF GAFASYSYND TTYRDDVTIT AGGTTVVRAT AGKTVVDTPK
HLLRGELSYD SQTVFGRVGV NYMSKRYFTY LNDQSVPGRA LVDATIGYRL DIGQRQPVEL
QLNAVNLFDK RYVATIGSNG FGFSGDNQTL LAGAPRQVFV TLKAGF