Gene Saro_3562 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_3562 
Symbol 
ID5077711 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_009427 
Strand
Start bp179047 
End bp181290 
Gene Length2244 bp 
Protein Length747 aa 
Translation table11 
GC content64% 
IMG OID640481286 
ProductTonB-dependent receptor, plug 
Protein accessionYP_001165948 
Protein GI146275788 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1629] Outer membrane receptor proteins, mostly Fe transport 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000000379624 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATGAACA CCAGAAAGAT CGCATTCGCT TGCATCGCTT CGACGATCGC GCTTGCGCAC 
CCGGCCTTCG CACAGGACGC TGCCGATCAG CCCAACGACG GCGACATCAT CGTCACGGCC
AACCGCACCT CGTCGCTGCT GTCCAAGACC CCGATCGCGA TGACCGCGGT CGCCGGCGAC
GACCTGATCC AGTCGGGCAT CACCAACCCG ACCCAGCTTG AAGAGACCGT TCCGAACCTG
TCGATCGTGC GCGGCAATGG CCTGCAGATC ACGATCCGCG GCGTGACCAG CACCGACGGC
ACCGAAAAGG GCGATCCCTC GGCGGCGTTC ATGGTCAACG GCGTCTACCT TGCCCGTCCC
CAGGCGCAGG AAGTCTCGTT CTTCGACATC GAACGCATCG AAGTTCTGCG CGGTCCCCAG
GGCACACTCT ATGGCCGCAA CTCGACTGCT GGCGTCGTCA ACATCCTGAC CGTCCAGCCC
AAGTTCGAAT TCGGCGCGCG CGCCGACGTG TCCTATGGCA ACTACGACGC GCTCAATGGC
ACAGTCGCCA TCAACCTGCC GGCGAGCGAG AGCATCGCCT TCCGCGTTGC CGCCAACATC
GACCGTCGCG ACAGCTACCT GATCGACGGC AATTCTGCCG ACGGCATCGG TATCGGCCGC
TTCAAGGACA ACAAGGCGGT CCGTCTCTCG GCCCTGTTCA AGCCGACCCC CGATCTCAGC
CTGCTGCTCG TCGGCGACTA TAGCTGGCAG AAGGGCAGCC CGACCAACGG CGTCGAGACC
TCGACTTTCT TCTCCGACAT CACCAGCGGT GGCCGCCCGA CGTTCGAGCG CCCGACCTAT
CTTGACCCGT CGGCCCGCGC CGGGCGCACC CTTCTTGCCA CTCAGGCGCA GTACGCATTC
CGCGATAGCA CCGACCGTGG CGTGATGGGG CAGCTCGATT ACACCATGGG CAACGTCACG
CTGACCTATG TCGGGTCCTA TCGCGAATCC GACCGCAAGG AATTCAGCAA TGTCGGCACC
CTGCCGATCA GCGCAGACTT CTACGGCAGC TATTGGCAGA CCTCGCAGGA AGTGCGCCTT
GCCTATGGCG GCGACGGCCC GCTCCAGGCG CAGGTCGGCG GCTACTACTT CAAGGAGAAG
TCGGGCATCG CCTTCTTCAT CAACAACCTG CTGGGCGCCA ACACCCGCTT CGGCTTCCCG
CAGGACCCGA CCATTGCCGA AAACAAGTCG GTGTTCGGTC AGGCGACCTA TGAAATCGCG
CAGGACGTGA AGCTGACCGG CGGTGTCCGC TATTCGCACG ACCTCAAGTC GCGCGTGGGC
GCAACGGTGC TCGATTCCTA TTCGTCGGTC GTGGACTCCT CGAACATCGG CGAGTTCCTG
GACCGCACCA CGTTCCAGGT GAATGACGCG AAGCGCAATT TCTCCAAGGT CACCTGGCGC
GCGGGCATCG ACTACGACAG CCCGCTGGGC CTGATCTACG CCTCGGTCTC GACCGGCTAC
AAGGCAGGCG GCTTCAACGA CGGCTGCGAA GTCGGCAAGG GCGACAACTG CGCCCTCGCG
GCCGGCGACC TCTACTACCA GCCCGAGGAG CTGACCGCCT ACGAAGCCGG CTTCAAGTTC
CGCATCAGCC CCGAGTTCCG CCTGAACGCG ACCGTGTTCC ACTACGACTA CAAGGGCCTG
CAGCTTTCGC AGGTGTCCAA TGCCTGCGGC GGTCCTTGCC AGGTCACTTC GAACGCGGCC
AAGGCCAAGG TCGATGGCGT GGAGCTTGAT GCCACGATCC AGCCGGTGGA CAACTTCACC
GTTCGTCTCG CGCTCAACTA CCTCGACGCG CGCTACGGAC AGTTCACGCC CAGCTACGAG
GACGAAGATG CCGAGGGCGG CTTCAGCTAC GTCAATTTCG CGGGCCGCGC GCTCAACCGC
AGCCCCAAGT GGAGCTGGGT TGCCGGCGTG AACTACGTGG TGCCGGTGGG CGAAGGCCGG
ATCGTGCTGG ATGCGCAGAC CGCGGCGCGC AGCAAGTACG AGCTTACCGA CCTTGCCAAC
TACGCCTACT TCTACCAGCC CGGTTTCTCG AAGACCGACG CGTCGATCAC CTACAACGCT
CCGCAGGACC GTTTCTATCT CGCCGCCTTC GTCGAGAACC TCGAGAACAA TCTTGTCCTG
ACCGGCGCGA CGACCGGCAC GCTCGGTTCG GTCACGTTCT CCGACCCGCG TACCTTCGGC
GTGCGCGCGG GCGTGAAGTT CTGA
 
Protein sequence
MMNTRKIAFA CIASTIALAH PAFAQDAADQ PNDGDIIVTA NRTSSLLSKT PIAMTAVAGD 
DLIQSGITNP TQLEETVPNL SIVRGNGLQI TIRGVTSTDG TEKGDPSAAF MVNGVYLARP
QAQEVSFFDI ERIEVLRGPQ GTLYGRNSTA GVVNILTVQP KFEFGARADV SYGNYDALNG
TVAINLPASE SIAFRVAANI DRRDSYLIDG NSADGIGIGR FKDNKAVRLS ALFKPTPDLS
LLLVGDYSWQ KGSPTNGVET STFFSDITSG GRPTFERPTY LDPSARAGRT LLATQAQYAF
RDSTDRGVMG QLDYTMGNVT LTYVGSYRES DRKEFSNVGT LPISADFYGS YWQTSQEVRL
AYGGDGPLQA QVGGYYFKEK SGIAFFINNL LGANTRFGFP QDPTIAENKS VFGQATYEIA
QDVKLTGGVR YSHDLKSRVG ATVLDSYSSV VDSSNIGEFL DRTTFQVNDA KRNFSKVTWR
AGIDYDSPLG LIYASVSTGY KAGGFNDGCE VGKGDNCALA AGDLYYQPEE LTAYEAGFKF
RISPEFRLNA TVFHYDYKGL QLSQVSNACG GPCQVTSNAA KAKVDGVELD ATIQPVDNFT
VRLALNYLDA RYGQFTPSYE DEDAEGGFSY VNFAGRALNR SPKWSWVAGV NYVVPVGEGR
IVLDAQTAAR SKYELTDLAN YAYFYQPGFS KTDASITYNA PQDRFYLAAF VENLENNLVL
TGATTGTLGS VTFSDPRTFG VRAGVKF