Gene Saro_1868 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_1868 
Symbol 
ID3917089 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp1969087 
End bp1971690 
Gene Length2604 bp 
Protein Length867 aa 
Translation table11 
GC content64% 
IMG OID640444612 
ProductTonB-dependent receptor 
Protein accessionYP_497142 
Protein GI87199885 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1629] Outer membrane receptor proteins, mostly Fe transport 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.215038 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCCGGCG CCAGCAGCAA GAATCAACAG GGAGAGACCA ACATGAGATT TGCATCGATC 
TCCATCGCAG CGCTCGCGAC AGCGATCGCC GCACCTGCCT TTGCGCAGGA CCAGGCCCAG
GCCGACGACA CCCGCAGCGG TGGCATCGCC GAGATCGTCG TGACCGCGCA GAAGCGCGCG
GAAAACGTCC AGGACGTGCC GATCGCCATC ACCGCGTTTA CCGCCGGTGC ACTGCAGGAA
CGCGCCGTGG GCGACGTTTC GGCGCTTTCC GGCATCACCC CCAACGTGAC GCTCGACGCA
TCGACCCCGT TCTCCGGTTC GAGCGCGGTG CTGGGTGCGA CCATCCGCGG CATCGGTTCG
TCGGACTTCG CCTTCAACAT CGACCCCGCT GTCGGCGTTT ATCTTGACGG CGTCTACCTT
GGCCGTTCGA TCGGCGCGAA CCAGGACCTG CTCGACGTCG AGCGCATCGA AGTCCTGAAG
GGCCCGCAGG GCACGCTGTT CGGCCGCAAC ACCATCGGCG GCGCGATCTC GATCGTGACC
CACAACCCCG GTGACGAATT CCATGCCAAG GGCGACGTCA CGGTTGGCCG TTTCAACCGC
ATCCAGGCGC GCGGGCTGGT CGACATTCCG CTGGCTCCGG GCCTCAGCTC GTCGGTTGCC
TTCGGTCTGC ACAAGCGCGA CGGCTTCCAG AAGCGCGTCG CCTATTCCGA TCCCGGCGCG
AACGACAGCT TCACACTGTT CCCGGCTTCG GGCTACGAAA CCCGCAGCCG CCAGGGGGGC
GACAATTCGT GGAACCTGCG CGGCAAGCTG CGCTGGGACG ACGGCGGCAA GTTCCGCGCC
ACCTTCAGCG CCGACTATAC CAACATCGAC CAGGATTCGA CGGCCAACAC CGTGCTTGCC
GTCACCCCGA TCCCGGGGCC GTTCGCGGGC GTTGCCGAGA ACAACATTCC GGGCACCGCG
CTTGACGTCG TCACCGGCAG CTCGGGCTTC CTGTTCGCAG GTCTCTACAA CTTCTGCATT
GGAGCCACCG CACAGCAGAT CGCCGACCGC AATGCGACCA ACCTGTGCGG CCCGCGTTCG
AGCGTCAACG GCTACCTGAC CCTGCCGGGT CTCGCCAGCC GCAACGTCGA CGGTGACCCG
TACAACGACC TGCTGCCTTA CGACGGCCGC TGGGTGAACA CCGACAAGGA CGTCAGCTAC
GCCAACGGCA ACAACTTCTC GAAGCTGAAG CAGTGGGGCC TTGGTCTCAA CCTCGAGTAC
GACCTGACTG ACAACATCGC GCTGAAGTCG ATCACGTCGT ATCGCGAAGT GGACTTCAAG
GCAGGCGTCG ACCTCGACAA CTCGCCGCTG CCGATCCTCC AGACGAGCTT CATCGTCGAC
CAGTACCAGT TCAGCCAGGA AGTCCAGCTT ACCGGTTCGG CAATGGACGG CGCCCTGAAC
TTCGTGCTCG GCGGCTATGG CTTCAAGGAA AACGGCGACC TGCGCGACTT CGTGACCTTC
TCGGCCGGCC TGCTGCAGGT TGACGGTCCG GGCAAGGTCG ACACCGAGGC CTATGCCGGG
TTCGGTCAGG TCGACTGGCG CGTGAACGAC CTCATCGGTA TCTCGGTGGG CGCGCGCTAC
ACCAAGGAGA ACAAGCGCTA CGACGGTGCG CAGTCGGACA TCAACGGCTT CAACTACAAG
CTGTTCAACT GCATGGCGCT GGACCCGGCG ACCGGCAACC CGAGCGCGGA ATGCGCCGCG
GGCGTCGGCT TCCCGATCCC GTCGGAACCG TTCCGCTACT ATCCAACTTC GCCGAACAAG
CAGACCTTCG ACGACTTCTC GTACAAGCTG GGCCTCCAGC TTCACCCGAC CGAAGACGTC
ATGGCCTATG GCTCGTTCTC GCGCGGGTAC AAGACGGGTG GCTGGACGAC GCGCCTGTCC
AACCCGCTGC CGGTCGCGCC GACCTTCGGC GAGGAAGTTG CCGAGACCTT CGAGGCTGGC
GTCAAGTCGA CGCTGCTTGA CCGTCGCCTG CAGCTGAATG CAGCGGTGTT CACGACCAAG
TACAAGGGCA TCCAGCTCAA CTTCCAGCAG GGCGTTTCGC CGACCATCCA GAACGCAGGC
GACGCGCGGA TCAAGGGCTT CGAGATCGAG GCGGTTGCCG CCCCGGCCGA TGGCTTCACG
ATCACCGCTT CGGCGGGCTA CCTCGACGCC TACTACACCA ACGTGCTGGC GCCGGCGCAG
GTTGCGCCCA ACCCGTTCCA GCTCGGCGTG CAGAAGGGGT CGGCCCTGCC CAAGGCTCCG
GAGTGGAAGT TCAACGTCTC GCCGCGCTAT GAAGTGGCAG TGGGCAACGG CAAGATCGTA
GCCCTGGCCG ACTGGACCCA CACCACCGGC ATGCGCAACG ATACCGAAGG CACCATCCTG
CTGCTGCGTC CGACGACCGA CATCGTCAAC GCCAGCCTGC AGTACCAGGC TCCCGACAAC
CAGTGGAACC TGACGGTCGG CGGGACCAAC ATCACCAACG AACGCTATCT GGTGACCGGT
CAGGCCCAGA TCGCGGGCGG CCAGATCTAC GGCACCTACA GCCGTCCGGC CGAATGGTAC
GTCAGGCTCG GTTTCGAGTT CTGA
 
Protein sequence
MPGASSKNQQ GETNMRFASI SIAALATAIA APAFAQDQAQ ADDTRSGGIA EIVVTAQKRA 
ENVQDVPIAI TAFTAGALQE RAVGDVSALS GITPNVTLDA STPFSGSSAV LGATIRGIGS
SDFAFNIDPA VGVYLDGVYL GRSIGANQDL LDVERIEVLK GPQGTLFGRN TIGGAISIVT
HNPGDEFHAK GDVTVGRFNR IQARGLVDIP LAPGLSSSVA FGLHKRDGFQ KRVAYSDPGA
NDSFTLFPAS GYETRSRQGG DNSWNLRGKL RWDDGGKFRA TFSADYTNID QDSTANTVLA
VTPIPGPFAG VAENNIPGTA LDVVTGSSGF LFAGLYNFCI GATAQQIADR NATNLCGPRS
SVNGYLTLPG LASRNVDGDP YNDLLPYDGR WVNTDKDVSY ANGNNFSKLK QWGLGLNLEY
DLTDNIALKS ITSYREVDFK AGVDLDNSPL PILQTSFIVD QYQFSQEVQL TGSAMDGALN
FVLGGYGFKE NGDLRDFVTF SAGLLQVDGP GKVDTEAYAG FGQVDWRVND LIGISVGARY
TKENKRYDGA QSDINGFNYK LFNCMALDPA TGNPSAECAA GVGFPIPSEP FRYYPTSPNK
QTFDDFSYKL GLQLHPTEDV MAYGSFSRGY KTGGWTTRLS NPLPVAPTFG EEVAETFEAG
VKSTLLDRRL QLNAAVFTTK YKGIQLNFQQ GVSPTIQNAG DARIKGFEIE AVAAPADGFT
ITASAGYLDA YYTNVLAPAQ VAPNPFQLGV QKGSALPKAP EWKFNVSPRY EVAVGNGKIV
ALADWTHTTG MRNDTEGTIL LLRPTTDIVN ASLQYQAPDN QWNLTVGGTN ITNERYLVTG
QAQIAGGQIY GTYSRPAEWY VRLGFEF