Gene Saro_0345 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_0345 
Symbol 
ID3918229 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp369514 
End bp371550 
Gene Length2037 bp 
Protein Length678 aa 
Translation table11 
GC content59% 
IMG OID640443074 
ProductTonB-dependent receptor 
Protein accessionYP_495627 
Protein GI87198370 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1629] Outer membrane receptor proteins, mostly Fe transport 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGATCAAGA CCATTACTAC CAGCAGCGCC CTTGCGCTGA TCTGTATCGG AAAACCGGCA 
TTTGCCGAAA CCTCTGAAAA GGCATCGCAA ACAAGCGCGG ATGCAGTCGT GGCGGGTAAC
GACATTGTGG TTTTCGGTCG CGGTGAGGCC ATGATCGGCA CTGCCAAATC AGCGAGCGAA
GGAAGCGTCG GAGGGGCCGA TCTCCTCGTG CGGCCTCTTC TCCGCGTGGC CGAACTGCTG
GAAGCGGTGC CCGGGCTGGT TGCAGCCCAG CATTCTGGGA GCGGCAAAGC CAACCAGTAT
TTCCTGCGCG GTTTCAACCT CGACCATGGG TCTGACTTCA GTACCTATAT CGATGATGTG
CAGATGAACT TTCGTACCCA CGGACATGGT CATGGCTATC TCGACCTCAA CGGCCTCATT
CCGGAGATCG TGGGCCGGGA GGATTTTCGT AAAGGGCCCT ACCGCGCCGA TGGCGGTGAT
TTCGCTCTGG CGGGAGCAGC CTATATGACG ACCATCAAGG GTTATGACCG GCCATGGGCA
TCGGCCGAGA CTGGTTCATA TGGTTGGCGC CGCGTCGCTG CGGGTGGAAC ATTGCACGAC
CTGGGCGGCG GAGACCTCAC GCTTGTCGCC CAGGCCAAGG CCTATGACGG ACCGTGGCAG
GAACCTGAAC GTCTGCGCCA TTACTCGGGG TTCGCGAAAT ATAGCATGCC GACCGGTGCG
GGCACATTGG AGGCATCTCT CCATGCCTAC CGGGCGACAT GGCACCCAAC CGAGCAAATC
CCCGAGCGCA TTATCGGCAC GGCGTTGTGT GCGGATGTGT TCTGCTCTCC AGATCCTTCC
GCGCGGGGTG AGACGACGCG CCTGGTGGCT AACATCGCGG TCAAGCAACC GACATGGCGC
GCCAATGTCT ATGCCCAGTT TTACGACTGG TCGATGTTCT CGAACCCCAC TTACACCGAT
CCGGATGGCA CAAGCGCGCA GATCAAGCAG TTCGACCGGC GTTGGGTCCT CGGGCTGTCC
GCACAAAAAC ATTGGGAAAT CGCTGACAGT CTGGCTGTGA GCCTTGGCAC CGAAAACAGA
TACGACGCCG TCGGGAATGT TGGTGTCGAT CGAACGGCTG CCCGCGCATT TCTTGAATCT
CTCGGGCACT TTCGGGTCGG GGAATTGTCT TCCGCGCTCT ACGGCGAAGT CGCTTGGAAA
CCCTTGGCGG GACTGCGTGT GACAGGTGGT CTTCGCGGGG ACTATTATCA CTATTCCGTG
CGTGCACGAG ATTCTGTTGC GGCGTCGCTG GGCGAAGGCA GTGGCTCAGC GTCGATTCTC
TCTCCCAAGG CGTCAATCGC CTATCAGGTT ACGCCGCATC TTGAACTCTA CGCCAACTGG
GGCCGTGGAT TCCATTCCAA CGATGTTCGG GGTGCGGTCA ACAGGGACAC GCCTGTTCCC
GTTCTGGTTC GCGGCATCGG CAAGGAACTG GGAGGACGCA TTCAATTCTC CGGGGTCACG
TTAACCGCGA CTTACTGGTG GCTGCATGTC GGCAGCGAAC TTCGTTTCAT TGGCGATTCC
AATGCTGTTG AACCGTCGGG TGCCAGTGGG CGTCATGGCT ATGAAATCGT CGCCTTCTGG
CGGCCGTTCC CTTGGCTTGC GCTTGATGGA AACTATACCG CCAGCCATGC GCGCTTCGAC
AATGGCGATC ACATCCCCAA TGCATTTGAG AACGCGGCTT CAGCCGGTGC CGCCATCATT
CTTGATCCCT GGGAAGCCAG CATTCGGGTG CGCCACCTTG GACCTTCTCC GCTTGTCGAG
GACAACAGTG TCCGGGATCG AGGCAGCACC GTCATGAATG CCCGGGCCGC GTGGAAGGGC
AAGAAGGTCG AGATATTTGG AGAAGTGCTG AACATCTTCG ACAGCCGGGA CAAGGACATC
GCCTATTATT ACGAGTCCTA CATCCCCGCC TTCGATGCAG GTGCTCCGGT GGAAGGCCGG
TTGAGCCGCG TGGTCGAGCC TCGAACTGTG AGGATTGGCG CAAAGGTCAA TTTCTAG
 
Protein sequence
MIKTITTSSA LALICIGKPA FAETSEKASQ TSADAVVAGN DIVVFGRGEA MIGTAKSASE 
GSVGGADLLV RPLLRVAELL EAVPGLVAAQ HSGSGKANQY FLRGFNLDHG SDFSTYIDDV
QMNFRTHGHG HGYLDLNGLI PEIVGREDFR KGPYRADGGD FALAGAAYMT TIKGYDRPWA
SAETGSYGWR RVAAGGTLHD LGGGDLTLVA QAKAYDGPWQ EPERLRHYSG FAKYSMPTGA
GTLEASLHAY RATWHPTEQI PERIIGTALC ADVFCSPDPS ARGETTRLVA NIAVKQPTWR
ANVYAQFYDW SMFSNPTYTD PDGTSAQIKQ FDRRWVLGLS AQKHWEIADS LAVSLGTENR
YDAVGNVGVD RTAARAFLES LGHFRVGELS SALYGEVAWK PLAGLRVTGG LRGDYYHYSV
RARDSVAASL GEGSGSASIL SPKASIAYQV TPHLELYANW GRGFHSNDVR GAVNRDTPVP
VLVRGIGKEL GGRIQFSGVT LTATYWWLHV GSELRFIGDS NAVEPSGASG RHGYEIVAFW
RPFPWLALDG NYTASHARFD NGDHIPNAFE NAASAGAAII LDPWEASIRV RHLGPSPLVE
DNSVRDRGST VMNARAAWKG KKVEIFGEVL NIFDSRDKDI AYYYESYIPA FDAGAPVEGR
LSRVVEPRTV RIGAKVNF