Gene Saro_3084 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_3084 
Symbol 
ID3916699 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp3302873 
End bp3305065 
Gene Length2193 bp 
Protein Length730 aa 
Translation table11 
GC content65% 
IMG OID640445867 
ProductTonB-dependent receptor 
Protein accessionYP_498353 
Protein GI87201096 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1629] Outer membrane receptor proteins, mostly Fe transport 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones32 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCAGGTC ACGTTTTCCT CAAGACTGCG AGCCTTTTCG CGCTGATGCT GTCGGCCGCG 
CCCGCCCTTG CCGCGCCCGC CCTTGCCGCG CCCGCCCTTG CCGCGGACGA ACAGGACCAG
CCCTCCGACG GCGGTCTTGG CGAAATCGTC GTGACCGCGC AGAAGCGTGC CGAACCGTTG
CAGAAGACGC CGATCTCGAT CGTGGCGCTG ACGGCGGACG ACATCGCGAA GAAGGGCATT
GCCGACCTTA CCGACCTGCG CTCGCAGGTG CCCGCGCTGC AGGTAACGCC GCATCCGAAC
AGCGCGACGA CAGCGCGTGT CTTCCTGCGC GGCGTCGGCA ACAACGACGA CCAGATCACC
GTCGATCCCA GCGTCGCGAT CTATCTCGAT GGCATCTACG TGGCGCGCGG GCAGGGGCTT
GCGGCCGAGA TCGCGGAGAT CGAGCGCATC GAGGTGCTGC GCGGCCCGCA GGGGTCGCTC
TACGGTCGCA ACGCCACGGG CGGTGCGATC AACTACATCG CACGGCAGCC CCGGCTTGGC
GAGTTCCACG CGCGCCAGTC GCTGGCCTAC GGCAACTACG ACCAGTTCCG TTCGCGCACG
AGCGTCAACG TTCCGGTCGG CGAAACCCTT GCCGTGGAAC TGGCCTATCT GCACAGCAGC
AAGGACGGCT TCGTCCGCAA CCTTGGTACC GGTGTGGAGC GTTTCGGGGA TCAGCGCCGC
GATGCCTATC GCGCGGCCGT GCTGTGGCAG CCGGCGCCGT CCTTCGAGCT GCGCTATGCC
TATGACCGGT CGGACATTGC CGATACCCCC GCGTTCATGG TCTCGGCGCC GTACTACCCG
CGCATGGCGG TTCGACCGAC CGCAGGCTCT CCCGCCGTCC GCGACCTTGC GGCGAATGAC
GTGACCGCGC AGGGCCATAG CCTCGTCGCG AGCTGGAACG CATCGGACGA GGTCACCATC
CGCTCGCTGA CGGGCTATCG CAAGCTCGCC AACTTCACCA ACCAGAACTA CCTGACCGGC
GTTGCCGGGC CATTCCCTGT CTTCGTCACC ACTTTCGACC AGAACCAGCG TCAGTGGAGC
GAGGAACTTC AGGTCGTCGG CTCTGCGCTC GACCGCCAGC TGGAATACAC GTTGGGCGCC
TATTTCTTCG ACGAGAAGGC CTTCAGCTAT GACACCACCG TGCCGACGGG GCGGGCGACG
ACACTGCGCA CGGCAACCGT GCGCAACCGT GCCTGGGCAC TCTATGGGCA GATGACCTGG
CGGCCCGAAG CGCTTGCAGG GGTCTATCTG ACCGGCGGGC TGCGCTGGTC GCGCGACAGC
CGCAAGGCAA CGCTGGACCA GACGTCGGTC GCACTGAACG GCACCAGGAC GGTCCGCCCC
CAGGGGCGCG GCGACAACAG CTTCACCGAC GTAAGCCCCA GCGTGATCCT GGGATACGAC
GTCAACCGCG ACGTCAACGT TTATGCAAAA TGGTCGCGCG GCTACAAGAC CGGCGGCTAC
AACCTGCGGG CCAGCACGAT AGAGCGTTTT GCCGAAGGCT ATGGCCCGGA GCGGCTCGAT
TCATTCGAAT TCGGCCTCAA GTCCAGCTGG CTCGATAATC GTCTTCGCGC CAACGTCGCG
GTGTTCCGGG CGAACTACCG GGATATCCAG GTCAACATCC AGTCCGATCC GGAGAACCCC
GCCGTCACCG ACATCTTCAA CGCAGGCGAG GCGCGCATCC AGGGTATCGA ACTGGACCTC
ACGGCCAAGC CATCGCGGGC GCTTACGGTC AATGCCAACT ACGCCTTCCT CGATGCCGGC
TACCGCCGGA TCACCGATCA GATCACCGGC GCGAACATCG CCTCGCGCTT CAACTTCGTC
GAGGCGCCGC GTCACACCCT GACCGTAGGC GCGGAATGGA CCCTGCCGGA AACGCCGCTG
GGCGTCCCAT CGGCAAGCGT CGACTACTAC ATGCAGAGCC GCAAATTCTC CTCGACCACC
GATGCGCGCT ACATCGTTGG CGACTACGGC CTGCTCAATG CGCGGCTCAG CCTTTCGGAG
ATCCCGGTGG GCTTCGGCAA GTGGCGGCTT TCCGCCTATG CCCGCAACCT GACGGATACA
AAGTACTACA TCGCCAACTT TTCTGCCGGG CTGCCCGCCG CCTTCTTTGG TGAACCGCGC
ACGTATGGCA TCGAATTGAA CTTCGAATAT TGA
 
Protein sequence
MSGHVFLKTA SLFALMLSAA PALAAPALAA PALAADEQDQ PSDGGLGEIV VTAQKRAEPL 
QKTPISIVAL TADDIAKKGI ADLTDLRSQV PALQVTPHPN SATTARVFLR GVGNNDDQIT
VDPSVAIYLD GIYVARGQGL AAEIAEIERI EVLRGPQGSL YGRNATGGAI NYIARQPRLG
EFHARQSLAY GNYDQFRSRT SVNVPVGETL AVELAYLHSS KDGFVRNLGT GVERFGDQRR
DAYRAAVLWQ PAPSFELRYA YDRSDIADTP AFMVSAPYYP RMAVRPTAGS PAVRDLAAND
VTAQGHSLVA SWNASDEVTI RSLTGYRKLA NFTNQNYLTG VAGPFPVFVT TFDQNQRQWS
EELQVVGSAL DRQLEYTLGA YFFDEKAFSY DTTVPTGRAT TLRTATVRNR AWALYGQMTW
RPEALAGVYL TGGLRWSRDS RKATLDQTSV ALNGTRTVRP QGRGDNSFTD VSPSVILGYD
VNRDVNVYAK WSRGYKTGGY NLRASTIERF AEGYGPERLD SFEFGLKSSW LDNRLRANVA
VFRANYRDIQ VNIQSDPENP AVTDIFNAGE ARIQGIELDL TAKPSRALTV NANYAFLDAG
YRRITDQITG ANIASRFNFV EAPRHTLTVG AEWTLPETPL GVPSASVDYY MQSRKFSSTT
DARYIVGDYG LLNARLSLSE IPVGFGKWRL SAYARNLTDT KYYIANFSAG LPAAFFGEPR
TYGIELNFEY