Gene Saro_1559 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_1559 
Symbol 
ID3917234 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp1615327 
End bp1617519 
Gene Length2193 bp 
Protein Length730 aa 
Translation table11 
GC content63% 
IMG OID640444299 
ProductTonB-dependent receptor 
Protein accessionYP_496833 
Protein GI87199576 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1629] Outer membrane receptor proteins, mostly Fe transport 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.224366 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGGATGA AGAGGGTAGC ACACCTGCGG CTCGTGGCCG TACTGGGAAC GAGCGCGCTG 
GCACTTCTCG CCGGTGGACA GGCTTACGCG CAGGAAGCAG TGGCACCGCA GGAGCAGGCC
ACCGAGGCAT CGGTGTTCGG CGACATCGTC GTCACCGCCA CCAAGAAGGC GAACGCGCAG
AACGTGCAGG ACGTGCCGAT TGCCGTCACC GCTTTCGGCT CCGAACAACT CGAAAGCCAG
CACGTGCGCA CGCTCGACAA CCTGGGCTAT AGCGCACCCA ACGTGCAGCT CGACGACGTC
GGCACCGCAC CAGGCTTTGC CAATTTCTCC ATCCGCGGCC TTGGCATCAA CAGCTCGATC
CCCTCTATCG ATCCAACCGT CGGCGTATTC GTCGACGGCG TCTACATGGG CATCAGCGCC
GGCATCCTGT TCGACACCTT CGACCTCGAA GGCGTCGAAG TGCTGCGCGG CCCGCAGGGC
CTGCTGTTCG GCCGCAACGT GACGGGCGGC GCGGTCGTGG TGCGCACATC CACCCCCGGC
AACGACCTCA AGATCGAAGG GCGCCTAGCT GCGGAAACCG GCCTCAACAA GATCGCCAGC
GCAGTGGTCT CCGGGCCGCT GATCAAGGAC AAGCTGGCCG CCAAGGTCGC GGTCTACTAC
AATGACGACG ACGGCTGGTT CACCAACAAG TTCAACGGCA ACAAGAACTT CGGCGCTTCC
AAGACGCTGA TCGTGCGCTC CGCCCTGCGC TACACGCCGA CTTCCGAGGT TGAGGCGGTG
GCCCGTTACG AACATGGACG CGTGCGCGGC GACGGCGCGG TAGTGAGCAA CTTCGGCCTC
TTCCGCCGCG AGAGCTTCGG TATCAGCGTC GACGAGGAAG GCGTTACCCG AAACGACTGG
AACCAGGCGT CGCTGGAACT GAACATCGAC ACCGATTTCG GCAACGGCAA GATCACCAAC
ATTGCCGCAT ACCGTGACTT CAAGGGCTTC GTGACGAGCG ATATCGACTC CTCGCCCAGC
TACACCTTCC ACGCAGACAC GCTCACCCGG CAAGACCAGT GGAGCAATGA ACTTCGCTAT
GCCGGCACGT TCGGTGCGCT GGAGCTGACC ACCGGGCTCT ACTACTTCCA GCAGGATATC
GACTACATCG AACTGCGCCG TCTTGCGGCG GGAGCGCTCA AGATCTCCGG CGGCGGCAAG
CAGCACCAGA AGACGTTCGG CGCCTTCGTT TCGACCGACT GGCACGTCAC CGATACAGTG
ACGCTGAACG GCGGCGTCCG CTATTCGTGG GAGCGCAAGA GCGCCAAGGT TGCAAATCTC
GCCGGCAACC TGTGCGACCC GATCGTCACG AAGACCTGCA GCACCTATGG TTTCTCCAAC
AGCAAGAGCT GGAGCGATCC GACCTTCCGG GTGGGCGCGC AATGGCAGCC GACGAACGAG
ACCCAGGCCT ATGCATTCTT CGCCCGCGGT TTCCGCAGCG GCGGCTACAA CTTCCGCAAT
GGCAATGCCG CAGAAGCCCC GGGGCCGTTC GACGCCGAGA AGCAGAACTC GTTCGAGGCA
GGCATCAAGC AGGACTTCGG CCGCACGCTG CGCCTGAACC TTGCGGCATT CCACAACACC
GTGCTTGGCC TGCAGCGCGA GATCATCCGC CCGGTCCTGC CGATCGGCAC GACGCAGGTC
ATTCGCAATT CCGCGAACGT CCGCATCCAG GGCATCGAGG CGGAAGCCGT GCTGCGCGTT
GGCGACCACC TCACCTTCAA CGGCCAGTTC GGCTATACCA AGGCCAAATA CACGAAGATC
CTCTACGACC TGACGGGCGA CGGCGCGATC AATGCCAAGG ACTTTGCCCT CAAGCCGCCG
CGCCTGGCAC CCTGGACCTA TGGGGTGAGC GCCAATTTCG CGCATGAAGT GACAGGTGGC
GGCGAAGTGA CGGCGCGCCT CGGCTATGCC CATCGCGATG CGGCCTGGTC GAACGATGCC
AACACCGGCC TGCTGAGCAA GGCAGACATG GTCGACGCGA ACCTCTCCGT AGAGACGGCC
GGTCGCAGGT GGAAGTTCTC GGTCTACGGC ACCAACCTGC TGAACGACCA GACCGAGGGC
AACGTCTCGA GCCTCCCGTT CTTTGCCGGA TCGACCTTCG CGTCGATCAA CAAGGGACGT
GTCGTCGGGG CCGAAGTTCT GTTCCGCTAC TGA
 
Protein sequence
MGMKRVAHLR LVAVLGTSAL ALLAGGQAYA QEAVAPQEQA TEASVFGDIV VTATKKANAQ 
NVQDVPIAVT AFGSEQLESQ HVRTLDNLGY SAPNVQLDDV GTAPGFANFS IRGLGINSSI
PSIDPTVGVF VDGVYMGISA GILFDTFDLE GVEVLRGPQG LLFGRNVTGG AVVVRTSTPG
NDLKIEGRLA AETGLNKIAS AVVSGPLIKD KLAAKVAVYY NDDDGWFTNK FNGNKNFGAS
KTLIVRSALR YTPTSEVEAV ARYEHGRVRG DGAVVSNFGL FRRESFGISV DEEGVTRNDW
NQASLELNID TDFGNGKITN IAAYRDFKGF VTSDIDSSPS YTFHADTLTR QDQWSNELRY
AGTFGALELT TGLYYFQQDI DYIELRRLAA GALKISGGGK QHQKTFGAFV STDWHVTDTV
TLNGGVRYSW ERKSAKVANL AGNLCDPIVT KTCSTYGFSN SKSWSDPTFR VGAQWQPTNE
TQAYAFFARG FRSGGYNFRN GNAAEAPGPF DAEKQNSFEA GIKQDFGRTL RLNLAAFHNT
VLGLQREIIR PVLPIGTTQV IRNSANVRIQ GIEAEAVLRV GDHLTFNGQF GYTKAKYTKI
LYDLTGDGAI NAKDFALKPP RLAPWTYGVS ANFAHEVTGG GEVTARLGYA HRDAAWSNDA
NTGLLSKADM VDANLSVETA GRRWKFSVYG TNLLNDQTEG NVSSLPFFAG STFASINKGR
VVGAEVLFRY