Gene Saro_1603 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_1603 
Symbol 
ID3918711 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp1667939 
End bp1670911 
Gene Length2973 bp 
Protein Length990 aa 
Translation table11 
GC content63% 
IMG OID640444343 
ProductTonB-dependent receptor 
Protein accessionYP_496877 
Protein GI87199620 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG4771] Outer membrane receptor for ferrienterochelin and colicins 
TIGRFAM ID[TIGR01782] TonB-dependent receptor 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGAGTT CAGCAACGCG ATCCATTGCG TTCCGCGCAT CCAGCATCGG CGCGATTGCG 
CTGGCGCTCG CCGGGCAATC CGCGCTGGCA CAGGACGTCG CTGCCGGCGA TGCCGAGCAG
TCGGGTGAAA TCGTCGTCAC CGGCATCCGC GCCTCGCTCA GCAAGGCTCT CGACATCAAG
CGCACGGCGC AGGGCGTGGT CGATGCGATC TCTGCCGAGG ACATCGGCAA GTTTCCCGAT
ACGAACCTTG CTGAATCGCT CCAGCGCATC ACCGGCGTGT CGATTGACCG TTCGAACGGT
GAAGGCTCGA CCGTCACGGT CCGCGGCTTC GGTCCGGAAT TCAACCTCGT CCTCCTGAAC
GGCCGCCAGA TGCCGACCTC GTCGCTGGGC GACGGGGCCA GCGCGCCGTC CTCTCGCTCG
TTCGACTTCG CCAACCTTGC CTCGGAAGGC ATCTCGGGCG TGGAAGTCTA CAAGTCGGGC
CGCGCCACGC TGCCCACCGG CGGCATCGGC TCCACCATCA ACATCAAGAC CCCGCGTCCG
CTAGACCGTC CGGGCCTGTC GGGCAGCCTC GCGGTTCGTG GCGTTTATGA CAGCTCGCGC
AACGAAGGCA ATCCGATCAC GCCGGAGGTT TCCGGCATCG TCTCGGATAC GTTCGGCGAT
GGCGTGTTCG GCATTCTCGT CACCGGCACC TGGCAGAAGC GCAAGGCCAG CGTGAACCAG
GCGAACGTCG GCTGGCGCGA CGGCTATCTC GGTTCGGAAA ACAACTGGGG TTCGCTGCCG
CAGGAAGGCG ATCCGCGTTA CGGCAGCATC ACCAACCGTC CCGGCCCGAA CGACGTCTAT
CAGGTCCAGC AGAACGCCAG CTACGATCTC AACGACATCG ACCGCGAGCG GCTGAACGGT
CAGGTCGTGC TGCAAGCTCG TCCGACCGAC AGCCTGACCG CGACGATCGA CTACACTTAT
TCGCGCAACA CCGTGCAGGT GCGCAATTCG AACGTCGGCG TGTGGTTCAA CTTCAACGAC
GTTTCCAGCG CCTGGACGGA CGGTCCGGTT GCTGGCCCGA TCTTCTATTC GGAAAAGTTC
GGGGCGGGCG AAGGCAAGGA CTTGTCCTAT TCGGGCTCGC TCACCGAGAA CCGCTCGGAA
AACAAGTCCA TCGGCGGTAA TCTTCAGTGG AAGGGGCCGG GCGGCCTGCG CCTCGAACTC
GATGGTCATC ATTCGACGGC CGAATCCGGT GCCAACAATC CTTATGGCAC CAGCACTTCG
GTCGGTACGG CGGTCTTCGG AATCAAGCAG CAGACGGTCA ACTACGAGAA CGACCTCCCG
GTCATTTCGG TCGTCATGCA TGACGGCATC GACCCGCTCA ACGCCGCGAA CATCCAGGCC
ACCGGCAATG CGTTCCGCAA CGCCTATTTC AAGGACACGA TCAACGAGGT CCAGTTCCGC
GGCGGTTACG ACTTCGACAA CTCGATCCTC GACAGCCTCG ATTTCGGCGT GACCTATGTC
GAGAACAAGG TGCGTTCGGC CTATGGCTTC ATCCAGAACG ATACCTGGGG CGGGTCGACC
ACGAAGGAGC AGCTCCCGGA CGACCTGTTC ACGCTCGAAT CCCTGCCTGA CAAGTTCAAG
GGTGTCTCGG GTGCCAGCGA CCCGGCGATG ATCCAGAGCT TCTACCGCTT CAACTTCGAG
AAGATGGTCG GCTTCCTCGA CGACCTGAAC GGCATCTGCG GCGGCGATGG CGATTGTCGC
GCGCCATTCA CGGTCGACCG GCGCATCCGC GAACGGACGC TGGCGCCCTA TGCTCAGGCG
AACCTGACCT TCGACCTGCT CGAAAACCCT GCGCATTTCC GCGCCGGCAT TCGCTACGAA
AAGACGAAGA TCACCTCTTC GGCGCTCGTG CCGATCCCGA CCGGCACGCA GTGGGTGGCC
GCGAACGAAT TCAACCTCAC CTACGGCAGC GGATCGGACT TCACCACGTT CAAGGGTGAA
TACGAGAACT GGCTGCCCGC GTTCGACTTC GACTTCGAGC CGATCGAGAA CGTCAAGGTC
CGCGCGAGCT ACAGCCACAC CATCACCCGG CCCGACTATG CCTCGATGCA GGGCGGCCGT
ACGGTGGACC AGCTCTTCCG CATCGGTGGC GGCTTCGGCA GCCAGGGCAA CCCGGGCCTG
CTTCCCTTCA AGTCGAAGAA CATCGACGTG TCGGCGGAGT GGTACTACGC CCCGGCCAGC
TACCTGTCGG TCGGTTTCTT CGACAAGCGG GTGAGGAACT TCATCTCGAG TACGCGGGTT
GACACCGAGG CGTTCGGGCT GACCAATCCG GCCGATGGCC CGCGTTACCA GGCCGCCGTG
GCCGCACTTG GCCCCAACGC CAGCACGACC GCGATCCGCA ACTACATCTT CGCCAACTAC
CCGTCTTCGG TGGTCGTCGA CAGCTACGAC CCGGTCACCG GAAACTACAC CGGCAAGATC
CTCGGTCTGC CCGAAGACAA CAAGGTGAAC TTCCAGATCA CCACGCCGAT CAACTCGGAC
CAGGCGGCAC ACCTCTATGG TTTCGAGTTC GCCGTGCAGC ACAGCTTCTG GGATACCGGC
TTCGGCGCGA TCCTGAACTA CACCGTGGTC AAGGGCGATG CGAAGTACGA CAATTCCCAG
CCGTCCAGCG TGCCGCAGTT CGCGCTGACC GGCCTTTCGG ACAGTGCCAA CGCCGTCCTG
TTCTATGACA AGAACGGGTT GCAGGCGCGC GTCGCCTACA ACTGGCGCGA CAAGTTCCTC
GCCGGCACGG GCCCCAACCC GTACTATGTC GAGGCCTATG GCCAGGTCGA CGCAAGCGCG
AGCTATGAGT TCCGCAAGGG ATACACCGTG TTCGTCGAGG CGATCAACCT TACCGGCTCC
AGCCGACGGG GGCACCTGCG CAGCACCAAC AACGTGTTCT TTTCGTCGCC GGGCTATGCC
CGCTACCAGG CCGGTTTCCG CTTCAATTTC TGA
 
Protein sequence
MKSSATRSIA FRASSIGAIA LALAGQSALA QDVAAGDAEQ SGEIVVTGIR ASLSKALDIK 
RTAQGVVDAI SAEDIGKFPD TNLAESLQRI TGVSIDRSNG EGSTVTVRGF GPEFNLVLLN
GRQMPTSSLG DGASAPSSRS FDFANLASEG ISGVEVYKSG RATLPTGGIG STINIKTPRP
LDRPGLSGSL AVRGVYDSSR NEGNPITPEV SGIVSDTFGD GVFGILVTGT WQKRKASVNQ
ANVGWRDGYL GSENNWGSLP QEGDPRYGSI TNRPGPNDVY QVQQNASYDL NDIDRERLNG
QVVLQARPTD SLTATIDYTY SRNTVQVRNS NVGVWFNFND VSSAWTDGPV AGPIFYSEKF
GAGEGKDLSY SGSLTENRSE NKSIGGNLQW KGPGGLRLEL DGHHSTAESG ANNPYGTSTS
VGTAVFGIKQ QTVNYENDLP VISVVMHDGI DPLNAANIQA TGNAFRNAYF KDTINEVQFR
GGYDFDNSIL DSLDFGVTYV ENKVRSAYGF IQNDTWGGST TKEQLPDDLF TLESLPDKFK
GVSGASDPAM IQSFYRFNFE KMVGFLDDLN GICGGDGDCR APFTVDRRIR ERTLAPYAQA
NLTFDLLENP AHFRAGIRYE KTKITSSALV PIPTGTQWVA ANEFNLTYGS GSDFTTFKGE
YENWLPAFDF DFEPIENVKV RASYSHTITR PDYASMQGGR TVDQLFRIGG GFGSQGNPGL
LPFKSKNIDV SAEWYYAPAS YLSVGFFDKR VRNFISSTRV DTEAFGLTNP ADGPRYQAAV
AALGPNASTT AIRNYIFANY PSSVVVDSYD PVTGNYTGKI LGLPEDNKVN FQITTPINSD
QAAHLYGFEF AVQHSFWDTG FGAILNYTVV KGDAKYDNSQ PSSVPQFALT GLSDSANAVL
FYDKNGLQAR VAYNWRDKFL AGTGPNPYYV EAYGQVDASA SYEFRKGYTV FVEAINLTGS
SRRGHLRSTN NVFFSSPGYA RYQAGFRFNF