Gene Saro_2410 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_2410 
Symbol 
ID3916729 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp2578324 
End bp2581104 
Gene Length2781 bp 
Protein Length926 aa 
Translation table11 
GC content63% 
IMG OID640445165 
ProductTonB-dependent receptor 
Protein accessionYP_497680 
Protein GI87200423 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1629] Outer membrane receptor proteins, mostly Fe transport 
TIGRFAM ID[TIGR01782] TonB-dependent receptor 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATTGGAC CCAACACATC CATTCGGGAC CAAGGAGGGG GCCGGCCTTA CACCGGCGAA 
ATCTTTGGCG CGCACTGCGC CAAGCTGTTG GGGGAAATAC GCATGACACT GCGGCATCTT
CTGTTGATCA CCACGACTGC CATGGTGGCC GCGCCCGTCG CCGCCCAGGC ACAATCCGTT
TCAAGCAACG AGGCTGACGG CATCGTCGTC ACCGGCATTC GCGCCTCGCT GCGCGACGCC
ATCGAGATCA AGCGCAAGGC CAGCTCGATC GTCGACGTGA TCAGCGCCGA AGATGTCGGC
AAGTTCCCCG ACGCTAACGT CGCTGACTCG CTCGCCCGTC TTCCCGGCGT CACCGTCGAC
CGCCAGTTCG GCGAAGGCGA ACAGCTTTCG ATCGCAGGCG TCGAACCCGC CCTCAACCGC
CTGACCATCG ACGGCCACTC GGTCGCCTCG GCCGACTGGG GCGGCAACCC CTCCGACCGT
TCGAGCCGCT CGTTCAACTA CTCGCTGCTG TCGCCCTCGA TCATCAGCCA GGCCGTGGTC
TACAAGACCC CCGAAGCCCG CCTCCAGGAA GGCGCTATCG GCGCGTCCGT CGACGTCGTC
ACGCGCAAGC CGCTCGACCT CGACTCCAAC ACCTTCCGCT TCAACGGCGG CTACGAATAC
AACGACCGCG CCGATCGCGG CAGCATCCGC GGCAACGCGC TCTATAGCTG GAAGAACGCC
GACGAAACCA TCGGCATCCT TGCCGCCGCC AACTACGACA AGGAACAACT GAGCCGCGCT
GGCGTGGCGG TCTACTGGTA CCGCACCGGC CAGGCCCTGC TGGACAACGC CCCGTCGACC
GCCACCGTCA ACGGCAAGGC GATCACCGAC CTGACCGAAG ACGAGCGCAG CGAATTCGCC
AGCTCGCGCT TCGCCTCGTT CCTCGCGCGC GAATTCTTCA AGCAGGAACG CGAACGCATC
GGCTTCAACG GCGCGATCCA GGTCCGTCCG TCCGACAACC TGAAGCTGAC CGGCACGGCC
CTGATCATTC GCGGCAACTA CGACAACGTC TCGAACTCGA TGTACACGTA TGGGTACGAA
GGTTCGCGCC TCATCTCGGC CCAGTTCAGC GGCGGTCTCG TGACCCAGGC GACCTTCTCG
GGCATCGCGG ACGGCCAGAC CGGCTCGACC GGCCAGCTCG ACACGCTCTA TCGCCGCACC
CGCGTGAAGA ACGACACCTA TTCGCTTGCC TACGAATGGG AACCCGACGG CTGGCACGTG
ACCGGCAACG TCGGCTACAC CCGCGCTTCG GGCGGCAAGG ACCCCGAATA CCTGCTCGAC
TTCCGCACCC AGCAGGGCTT CACTTCGGGT GCTAACGGCC GCAACACGAC CGTCGACTGG
GACAGCCCGG CGACCGACCC GACCAAGTGG CTGAGCAATT ACACCGCAAA CGGCGGCGAG
AACATCACCG CCTCCGACGG CCGCACCTTC TTCGGTCGCC AGATCGGCGG CATCCCGACC
AAGTCCGGCT TCACGCTGGA CAAGGAATGG TTCACCGAAG CCAATGCCGT GCATGACCTG
AACGCCGGGC CGTTCACCCA GCTGCTGTTC GGCGCGCGCT TCACTTCGCA CGAAAACAGC
AACACGACCT ACAGCAACGC GATCTTCACC GACCAGGACT TCACCCTGGC GGATCTTGAC
TACAACGTGC TGCCTTCGGG TCTCTATGAT GGCCTTGGCA CCTCGGGCAA CGGTGCACCC
TATGTCGGCA TGGACAAGGA CGGCATCATC GCCGCCCTGG CCAAGTACGG CAATTTCACC
GACGACCGCG GCCTGGCGAA GGGCGAATAC TGGCTGGTGA AGGAAAAGAT CGCGGCCGGC
TATGCCCAGG CGAACTTCGA AACCGGCAAG CTGCGCGGCA ACGTCGGCGT CCGCTTCGTC
AGCACCAAGA CGGAATCGAA CTTCTACGCG CAGAGCGGTT CGACCGTTCA GCTCGTGCAG
TACAACAAGA CCGACAACCG GTTCCTGCCC TCGATCAACG TGATCTACGA CGCGGGCGAT
ACGGTTGTGC TGCGTGCAGC GGCGGCCAAG GTCATGTCGC GTCCGCGCTA CTCCGACCTT
GCCGGCTACC TTTCGTTGAC CGACTCCACC TTGAGCGGCA GCGGCGGCAA CCCCGACCTG
AAGCCCTACC TTGCGACCAA CTTCGGGTTC TCGGCCGAAT GGTACTTCGC ACCGGGCAGC
TTCCTTTCGG GCGAAGTGTT CTACCGCGAC ATCTCGAACT ACGTGGGCAA CGAGACGCTC
GAGACCGAAC TGACCAACCC GATCACCGGC AACACCCTCA CCTATGCGGT GTCGCGCCCG
GTAAACGGCG GCAAGGCATC GGTCACCGGC TTCTCGATCT CGGGCAACAC CAACCTCGCC
TGGGGCTTCG GCATCCAGGC CTCCTACACC TTCGCCGATG CCGAGACGAG CAAGGCGGAA
GGCCTGCCCT TCCTGTCGCG CAACACGATC CAGATCTCGC CCTACTACGA GAACGGACCG
TTCCAGGCGC GCGTCAGCTA CAACCGTCGT TCGAAGTACT TCTACCGCTT CGGCCGCCAG
CAGTCGCAGG ACTACACCGA CGCCTACCGC CAGCTTGACG CGCAGGTCTC CTACGCGATC
AACGAGAACC TGAGCGTGAC GGCGACGGCT TCGAACCTGC TGGACGAGAC GTACTACCAG
TACAGCTCGA CCAAGAACGC GCCGACTTCG ATCTACAAGA ATGGCCGCGT CTATTCGGCC
AGCATGACTG CGAAGTTCTA A
 
Protein sequence
MIGPNTSIRD QGGGRPYTGE IFGAHCAKLL GEIRMTLRHL LLITTTAMVA APVAAQAQSV 
SSNEADGIVV TGIRASLRDA IEIKRKASSI VDVISAEDVG KFPDANVADS LARLPGVTVD
RQFGEGEQLS IAGVEPALNR LTIDGHSVAS ADWGGNPSDR SSRSFNYSLL SPSIISQAVV
YKTPEARLQE GAIGASVDVV TRKPLDLDSN TFRFNGGYEY NDRADRGSIR GNALYSWKNA
DETIGILAAA NYDKEQLSRA GVAVYWYRTG QALLDNAPST ATVNGKAITD LTEDERSEFA
SSRFASFLAR EFFKQERERI GFNGAIQVRP SDNLKLTGTA LIIRGNYDNV SNSMYTYGYE
GSRLISAQFS GGLVTQATFS GIADGQTGST GQLDTLYRRT RVKNDTYSLA YEWEPDGWHV
TGNVGYTRAS GGKDPEYLLD FRTQQGFTSG ANGRNTTVDW DSPATDPTKW LSNYTANGGE
NITASDGRTF FGRQIGGIPT KSGFTLDKEW FTEANAVHDL NAGPFTQLLF GARFTSHENS
NTTYSNAIFT DQDFTLADLD YNVLPSGLYD GLGTSGNGAP YVGMDKDGII AALAKYGNFT
DDRGLAKGEY WLVKEKIAAG YAQANFETGK LRGNVGVRFV STKTESNFYA QSGSTVQLVQ
YNKTDNRFLP SINVIYDAGD TVVLRAAAAK VMSRPRYSDL AGYLSLTDST LSGSGGNPDL
KPYLATNFGF SAEWYFAPGS FLSGEVFYRD ISNYVGNETL ETELTNPITG NTLTYAVSRP
VNGGKASVTG FSISGNTNLA WGFGIQASYT FADAETSKAE GLPFLSRNTI QISPYYENGP
FQARVSYNRR SKYFYRFGRQ QSQDYTDAYR QLDAQVSYAI NENLSVTATA SNLLDETYYQ
YSSTKNAPTS IYKNGRVYSA SMTAKF