Gene Saro_3785 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_3785 
Symbol 
ID5077933 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_009427 
Strand
Start bp427226 
End bp429472 
Gene Length2247 bp 
Protein Length748 aa 
Translation table11 
GC content65% 
IMG OID640481508 
ProductTonB-dependent receptor 
Protein accessionYP_001166170 
Protein GI146276010 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1629] Outer membrane receptor proteins, mostly Fe transport 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.440403 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCCGCGA CACAACACCG CGTTCGCGCG CACGTTGCAC TAGGTGCGGG ACTTGCCGCA 
CTTTCATCCA CGGTCGCCGT GGCCCAGCCC CCCGGCGAAA CGATTGAAGG CGCTGGTGAA
ATCGTCATCA CCGCACAGAA ACGGCCGGAA CTCGCAGGCG AAGTGCCGCT GTCCATTTCG
GTGATCGACG GCGAGACCTT GCAGGCTGCC CGGCTCAGTC AGGCCGACGA TATCGCGGCA
TTGGTCCCCA ATCTCCGTTT CAGCGCAACG GTCGGGGAAA ACACGCCGAT CTTCGCGCTG
CGCGGAGTGT CCATGTCGGA CTTCAGCCTC AACCAGGCCG GACCGGTCGC GACCTATTAC
GACGAGGTCT ACAAGGGCAA CTTCGCCTTC CTTGGGGTCC AGCTTTATGA CCTTGCACGG
ATCGAGGTCC TGCGCGGACC GCAGGGCACG CTCTACGGCA AGAACACCAC TGGCGGCGCG
ATCAACTACC TTGCCGAACG GCCACGGTTC GAGAACGGGG GGTACCTGAA AGCCGGCATC
GGAAACTTCG GGAGAGCCGA AGGCCAGGGC GCGTTGAACC TCGCCATCTC GTCCACGCTG
GCCGCGCGCA TCGCCTTCAC CGCCGCGCGC GCCGACGGTT GGTTCCGCAA CCGGCTGGCG
GGTAGCCCGA ACCTTTCAGC CACCCGCGAG TACGGCGTGC GCGGCTCGAT CCTGTGGAAG
CCGTCGGATA GCGCCGAACT GGTCCTGCGC CTGTCGACGA GCCTCCAGAC GCCGCAGAAC
TACGGCACCT ATTCGGTACC GGGGCCCGGC GGCACAGGCG CGGGCGTCTA TGAAGCATAC
GGTCGGGGCA TGAGCTACTT TCGCACCGGC ATCGGCAAGC GGGAAATCGA AGCGAACTTC
ACGCCGCGTC GCCGCGCCCG CACATGGTCG GCGGCATTGA CCGGCACGTT CCGGCTGAGC
GACAACCTGT CGCTCGTGTC CGTGACGGGA TGGGACCGCG GCAGCCTCTT CGTGCCGGAG
GACACGGATG GCAGCCCGAC CCGGACCCTC GAAATTCCCT ACACGGACCG CGGCACGCAG
TTCGGGCAGG AACTCCGCCT TGCATACGAC GGCGATGGAG CGCTGAGCCT GATCCTGGGC
TTGCACCACC ATCGCGAGGA CCTGTTCAAT GCGACCGACC TGAACTTCTG GACCGACCTC
GACGTCGATG GCAACGGGCG GGTCGACGTT GATGACTGCT CCGCGAACGC CAGCCTTATG
GCCTGTGCCA TTTCCAACCG CTTCGACCAG CGCAAGCGGA GCTGGGCACT GTTCGGCGAT
GCGCACATGA AGCTCGGTAC CAGGACGGGC CTGCGGGGCG GGCTGCGCTT CACCCGGGAC
ATCGGGCTGC AGGCGGGGCT GACGTCGCAA TTACGCGGCG TCGATGGCGT GCTTGTCGCA
ACGCCCATCC TCCCGCTCGA CCGCAGCTTT GCGGGCAGCA ACCTTTCCGG CAAGATCGGT
ATCGACCACA AGCTGGCCGA TGGCACCATG TTCTTCGCCA GCTACAGCCG GGGCTACCGG
GCAAGCGGGT TCAATGCGCA GGCTTTCTTC GATGCCGCCG AAGCCGGTGT GGCCCGACCC
GAGACGATCG ATGCGTTGGA AGCTGGCGCC AAGACGCGAT TGGCCGGTAA CGCGCTGGCG
GTCGCGGTGA CCGGCTTTCA CTACATCTAC CGCAACCAGC AGTTCCTTTC GGTCGACCCT
GCCGATGCGA CGCAGACACT CGTCAATCTC GACCGGTCGC GCATCTATGG GGCCGAGATA
GAGCTGGAGG CTCGGCCCAC GTCCGAGATT GCGGCTCAGA TCGGCGTCGG CATCCTGCAT
GCGCGGGCAA CGCAAGGCAT GATCGGCGGC CTCGACGTGA GCGGCCACAG CCTTTCCAAC
GCGCCCTCGC TTACTCTCAA CGCCGCAGTG GCTGCGACGA TATGGGAACG CGGGCCGGCG
CGCATGGCGC TGCGCGGGGA CGCCAGCTAC ACGTCCTCGC AGTTCTTCGA GATCGTCAAC
ATCCCCCGCC TGCGTCAGCC CGGATATGCG CTGCTTGGCG CAAGTGTCGA TTACGCGCGC
GGTCCGATGA TCCTATCGAT CTGGGGCAAG AACCTTGGCG ACAAGGTCTA TTTCACCTCG
AGCATCGATC TTTCTGGATT CGGCTTCGAT TACAATCACG TGGGGACACC CCGCACCTAT
GGAGCGACCG CCAGGGTCAG CTTCTAG
 
Protein sequence
MSATQHRVRA HVALGAGLAA LSSTVAVAQP PGETIEGAGE IVITAQKRPE LAGEVPLSIS 
VIDGETLQAA RLSQADDIAA LVPNLRFSAT VGENTPIFAL RGVSMSDFSL NQAGPVATYY
DEVYKGNFAF LGVQLYDLAR IEVLRGPQGT LYGKNTTGGA INYLAERPRF ENGGYLKAGI
GNFGRAEGQG ALNLAISSTL AARIAFTAAR ADGWFRNRLA GSPNLSATRE YGVRGSILWK
PSDSAELVLR LSTSLQTPQN YGTYSVPGPG GTGAGVYEAY GRGMSYFRTG IGKREIEANF
TPRRRARTWS AALTGTFRLS DNLSLVSVTG WDRGSLFVPE DTDGSPTRTL EIPYTDRGTQ
FGQELRLAYD GDGALSLILG LHHHREDLFN ATDLNFWTDL DVDGNGRVDV DDCSANASLM
ACAISNRFDQ RKRSWALFGD AHMKLGTRTG LRGGLRFTRD IGLQAGLTSQ LRGVDGVLVA
TPILPLDRSF AGSNLSGKIG IDHKLADGTM FFASYSRGYR ASGFNAQAFF DAAEAGVARP
ETIDALEAGA KTRLAGNALA VAVTGFHYIY RNQQFLSVDP ADATQTLVNL DRSRIYGAEI
ELEARPTSEI AAQIGVGILH ARATQGMIGG LDVSGHSLSN APSLTLNAAV AATIWERGPA
RMALRGDASY TSSQFFEIVN IPRLRQPGYA LLGASVDYAR GPMILSIWGK NLGDKVYFTS
SIDLSGFGFD YNHVGTPRTY GATARVSF