Gene Saro_3530 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_3530 
Symbol 
ID5077679 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_009427 
Strand
Start bp145800 
End bp148157 
Gene Length2358 bp 
Protein Length785 aa 
Translation table11 
GC content64% 
IMG OID640481254 
ProductTonB-dependent receptor 
Protein accessionYP_001165916 
Protein GI146275756 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1629] Outer membrane receptor proteins, mostly Fe transport 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAGTGT CACGTGCAAT CCAGTTGATG AGCGGTCTTT CCGCCATCGC GCTCGCGGCT 
GTTCCCGCCG GGGCAATGGC GCAGGCCGCG CCCGATCAGG CCCAGGCCGC TGATGCGCCC
ACCGGCGGCA TCGGCGAAAT CATCGTCACC GCGCAGAAGA AGGCGGAAAG CATCCAGACC
GTGCCGATCT CGATCGCGGC GGTCGGCGGC GAACAGCTTT CGGCAATGAA CGTCACCACG
CTCCAGGCAC TTCAGGGCTC GGTCCCGAAC GTCCAGATCG ACAACTTCGC CAATACGCCC
AACAACGCGG TCTTCACCAT TCGCGGCATC GGCGTGATCG AACCCGATCC CTATGCCGGC
AACACCGTGT CGATCGTGGT CGATGGCGTG CCGCAGTTCT TCTCGATGGG CGCGCTGCTC
GACACCTACG ACACCAACCG GGTCGAGATC CTGCGCGGTC CCCAGGGCAC CCTGTTCGGC
GCAAACACCA CCGGCGGCGT CGTCAACGTC GTGACCAACC AGCCGGACGG CAAGTTCGAC
GGCTACGTCA AGGGCACCTA CGGCAACTGG AACCGCTTCG ACATCAGCGC CGCCGTGGAA
GCGCCGCTGG TCGAGGACAC CCTCAGCCTC AAGGTCTCGG GCATCCACAC CCAGCGCGAT
GGCTGGACGA CCAACGTGTG GAACGGCGAA GACATGGGCC GCAAGAACGT CGATGCGGTG
CGCGGCCAGC TCTACATCAC GCCCAACGCC GATCTCAGGA TCACGCTCCA GGGTGAATAC
GTCGCCGCGC GCAACGGCGC GCCCATCGTC GTCAACGGCG GCCTGCCCGG CGAAGGCAAC
TACGTTCCCG AAGGCACGTT CTGGAACGGC GCCAAGCTGC CGATGTACCA GAGCCCCTGC
TCGGTCGAGG GCCAGCCCTG CAAGGCGCCC GACAAGTACT ACTCCGGCAA CAACGAAGTG
CCCGACCAGT CGGACATGAC GACGAAGTTC TTCGTCGGCA CGATCCAGTA CGACAACACC
CCGCTGGGCG ACATCACCGC GATCACCGGC TACAAGCGCT TCACGCTGTT CGAATACACC
GACCAGGACG GCACCGCGAA GACCAACAAC GCAACGCGCC GCCGCACCCG CGGCTGGCAG
TTCAGCCAGG AACTGCGCAG CGCCTTCGAG GCGGGAGACA ATTTCAACGC CGTCGCGGGC
CTGTTCTACC TGAAGACGCA CTACAACCAC TACCAGATGT ACCACCTGGA CTTCGCCCTT
CCCGGCCTCG TCCAGTACAA CGAACAGGAC CAGGGCACGG AATCGTTCTC GGCCTTCCTG
CAGACCTACA CCCAGCTTAC CGACCAGCTG AAGCTGTCGG CCGGCGTGCG CTACACGCAT
GACAGCGTGA ACGCCCGCTC CACGCTCGAC TACGGCGTCG GCGCGCCCGC GCTCACCGAT
CCGAACTGGG CCATCATCCC GACCATCGTC GTCGATGGCG AGACTCTCCA GGTCGGCCGC
GACCTTCGCA CCGGCCCGCA CGACATCGAC GTCGGCGGCA AGAAGAGCTG GGACAACGTC
GGCTGGAAGC TCGGCCTCGA TTACGAGATC GGCCAGAACC AGATGGTCTA TGCAAGCTGG
GCGCGCGGCT TCAAGTCGGG CGGCTTCACC GGCCGCATCG GCACCGCGTC GGACGGCGAC
ACGCCCTACG GCCCGGAAAA GGTCGATACC TTCGAGGTGG GCCTCAAGGC CGACTTCCTC
GACCGCCACG TGCGCACCAA CCTCGCGGTG TTCTACACCA ATTACCGCGA CATGCAGGTC
GCCCAGATCT ATTTCGATCC CGATACCAAC ACTCAGGGCA ACCGCATCCT CAACGCCGCC
AAGTCCGAGA TCAAGGGCTT CGAACTTGAA GTCCAGGCGA TCCCGTTCGA AGGCCTCACC
CTGCGCGGCT CGCTCGCCTA CCTCGACACG AAGTACAAGA GCTTCCTCTA CTTCGATCCG
GTCGCGGAAG AGTACCTCAA CCTCAAGGGC TACGCGCTCC AGAACGCGCC GAAGTGGGCT
TCGACGCTTG GCCTCAACTA CACCAAGACG ATGGACAACG GGAACTCGAT CGTTGCCGAT
GTAAGCTGGA TGTACACCGG GCAGAAGTTC TACACTGCCG TGGTCAACAC CCCGCGCTCG
TCGATCCAGC CGACCTACTA CGTCGACGGC ATGCTGACCT GGTACGGCCC GGACAAGCGC
TACTCGATCG GCCTGTGGGG CAAGAACCTG TTCGACAAGC GCTATATCTC CACCGTCTAC
GACAGCCCCG GCTACATGGG CCTCGTCGGC TACGCACCGC CGCGCCAGTT CGGCGTTTCG
GTCGGCTACA ACTTCTGA
 
Protein sequence
MKVSRAIQLM SGLSAIALAA VPAGAMAQAA PDQAQAADAP TGGIGEIIVT AQKKAESIQT 
VPISIAAVGG EQLSAMNVTT LQALQGSVPN VQIDNFANTP NNAVFTIRGI GVIEPDPYAG
NTVSIVVDGV PQFFSMGALL DTYDTNRVEI LRGPQGTLFG ANTTGGVVNV VTNQPDGKFD
GYVKGTYGNW NRFDISAAVE APLVEDTLSL KVSGIHTQRD GWTTNVWNGE DMGRKNVDAV
RGQLYITPNA DLRITLQGEY VAARNGAPIV VNGGLPGEGN YVPEGTFWNG AKLPMYQSPC
SVEGQPCKAP DKYYSGNNEV PDQSDMTTKF FVGTIQYDNT PLGDITAITG YKRFTLFEYT
DQDGTAKTNN ATRRRTRGWQ FSQELRSAFE AGDNFNAVAG LFYLKTHYNH YQMYHLDFAL
PGLVQYNEQD QGTESFSAFL QTYTQLTDQL KLSAGVRYTH DSVNARSTLD YGVGAPALTD
PNWAIIPTIV VDGETLQVGR DLRTGPHDID VGGKKSWDNV GWKLGLDYEI GQNQMVYASW
ARGFKSGGFT GRIGTASDGD TPYGPEKVDT FEVGLKADFL DRHVRTNLAV FYTNYRDMQV
AQIYFDPDTN TQGNRILNAA KSEIKGFELE VQAIPFEGLT LRGSLAYLDT KYKSFLYFDP
VAEEYLNLKG YALQNAPKWA STLGLNYTKT MDNGNSIVAD VSWMYTGQKF YTAVVNTPRS
SIQPTYYVDG MLTWYGPDKR YSIGLWGKNL FDKRYISTVY DSPGYMGLVG YAPPRQFGVS
VGYNF