Gene Saro_1662 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_1662 
Symbol 
ID3918771 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp1741060 
End bp1743120 
Gene Length2061 bp 
Protein Length686 aa 
Translation table11 
GC content68% 
IMG OID640444403 
ProductTonB-dependent receptor 
Protein accessionYP_496936 
Protein GI87199679 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG4771] Outer membrane receptor for ferrienterochelin and colicins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.917445 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGGCCAT TGGCCCGCCC CTTCTTCCTG ATCGCCACAC TCGCCCCCTG CGCTGCCCAC 
GCGCAGGATG CGGTCCAGCC CGAAATCGTG GTGATCGGCA CCGGCCTTCA GGCCCCGCCG
TCGGCACCGG CCTACAACGT CCAGCAGATC GATCGCGAAC GCCTTCTCGA AACCGCATCC
GGCCGACTCG AGGACGCCCT TTCCTCTGTC GCGGGTTTCC AGCAGTTCCG CCGCTCCGAC
AGCCGTTCGT CCAATCCTTC CGCGCAAGGC GTGACGCTGC GCTCGCTTGG CGGCAATGCG
ACCAGCCGCA CGCTCGTCCT GCTCGATGGC GTGCCGATGG CCGATCCATT CTTCGGCCAT
ATCCCTTTCA GTGCGATAGC CCCCGAACGG CTCGCGACTG CACGCGTCAC GCGCGGCGGC
GGCGCAGGCG CGTTTGGCGC GGGCGCGGTT GCCGGAACGG TGGAACTCGA AAGCGCGAAC
GCGGACCAGC TCGGCCTCGT CCAGGCTGGT GCGCTGGCGA ACGACCGTGG CGAGACGGAA
CTTTCCGGCG CCCTCGCCCC CCGCGTGGGC AAGGGGTTTG CGGTGATCTC CGGCCGCTGG
GACCGCGGTC AGGGCTTCTG GACCACGCCC GTCGGCCAGC GCGTGCCGAT CAGCGCCCGG
GCACGCTACG ACTCATGGTC GGCCGGCCTA CGCCTGGTCG CCCCCCTGTC CACCGACGTC
GAGTTGCAGA TGCGCGGGCT TTTGTTCGAC GACCGGCGCA CCTTGCGCTT CCGGGGGGCA
GACACATCCT CGAGCGGACA GGATGCTTCG TTGCGCCTCG TTGGGCGTGG CGCTTGGGCC
TTCGACGTGC TCGCCTATGT GCAGGCGCGC GACTTCACCA ATGTCGTCAT CAGTTCGACC
AGCTTCCGCA AGACGCTCGA CCAGCGAGCC ACGCCTTCGA CGGGCGTGGG TGGCAAGGCC
GAATTGCGCC CGCCCGTTGG CGGTAACCAC GTGCTCAGGC TCGGCGCGGA CTACCGCCTC
AACGATGGCG ACATGGCCGA GGACGCATAT TCGGCGGCTA CTGGCCTGGT TACCGCCCGT
CGCCGGGCAG GTGGCAAGAC CAGCGACCTC GGCCTGTTCC TCGAAGACGA CTGGACGCTC
GGCCGCCTCG TCCTCAACGC CGGCGCCCGC GCCGACCGCT GGACAATCCG CGACGGCTAT
TTCCTTGAGC GGAGCCCCTC GGGCGCCACG ACCATCGATT CCGCGCTGGA CCCCGCCTTT
GCCGACAGGT CCGGCTGGCA GGCAAGCTTC CGAGGCGGCG CCGTGTTCAG GGCCACCGAC
ATGCTCTCGC TTCGCGCTGC CGGGTACACC GGATTGCGCT TGCCGACGCT CAACGAATTG
TACCGCTCGT TCAGCGTCGT CGCGCCGCGC AGCGAGGGTG GCATCGCGAT CACCGCAACG
CAGCGCAATC CGCTTCTGCA CAACGAGAAG CTCGAGGGCT TCGAAGCGGG GCTGGACTTC
ACCCCCGCCC CCGGCCTGGC GTTCAACGCG ACTGCCTTCG ACAACCGCAT CCGCAACGCC
ATCGCCAATG TCACCCTGGG CACGAGCGGC AACACGACCA CGCGCAAGCG TCAGAACGTC
GATGCCGTCC ATGCGCGCGG TCTTGAATTC GGCGCCAGGC TGCGGGCGGG CGCGATCTCG
CTCGATGGTT CGCTGGCATG GACCGACGCC GAGGTGGAGG CATCGGGCAT TTCCGCCTCG
CTCGACGGCA AGCGTCCCGC GCAGACGCCG CGTTGGGCGG GCACGGCCAC CCTTGCCTGG
CGGCCCGCCG AACGCTGGTC GCTGGGCCTC ACCCTGCGGC ACGTTGGCGC GCAGTTCGAG
GACGATCTGG AAACCGACCT CCTCCCGTCG GCCACAACGC TCGACGGCTT TGCCCAGCTT
CCGCTCCATG GCCCGATCAG CCTTGTCCTG CGCGGCGAGA ACCTTACTGG CGAGACGATA
GTGACGCGGC TGCAGGACGG TTCTATGGAT ATCGGCACGC CGCGCACGTT CTGGGCGGGT
ATTCGTGTGG AGGTCCGTTG A
 
Protein sequence
MRPLARPFFL IATLAPCAAH AQDAVQPEIV VIGTGLQAPP SAPAYNVQQI DRERLLETAS 
GRLEDALSSV AGFQQFRRSD SRSSNPSAQG VTLRSLGGNA TSRTLVLLDG VPMADPFFGH
IPFSAIAPER LATARVTRGG GAGAFGAGAV AGTVELESAN ADQLGLVQAG ALANDRGETE
LSGALAPRVG KGFAVISGRW DRGQGFWTTP VGQRVPISAR ARYDSWSAGL RLVAPLSTDV
ELQMRGLLFD DRRTLRFRGA DTSSSGQDAS LRLVGRGAWA FDVLAYVQAR DFTNVVISST
SFRKTLDQRA TPSTGVGGKA ELRPPVGGNH VLRLGADYRL NDGDMAEDAY SAATGLVTAR
RRAGGKTSDL GLFLEDDWTL GRLVLNAGAR ADRWTIRDGY FLERSPSGAT TIDSALDPAF
ADRSGWQASF RGGAVFRATD MLSLRAAGYT GLRLPTLNEL YRSFSVVAPR SEGGIAITAT
QRNPLLHNEK LEGFEAGLDF TPAPGLAFNA TAFDNRIRNA IANVTLGTSG NTTTRKRQNV
DAVHARGLEF GARLRAGAIS LDGSLAWTDA EVEASGISAS LDGKRPAQTP RWAGTATLAW
RPAERWSLGL TLRHVGAQFE DDLETDLLPS ATTLDGFAQL PLHGPISLVL RGENLTGETI
VTRLQDGSMD IGTPRTFWAG IRVEVR