Gene Saro_1960 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_1960 
Symbol 
ID3917276 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp2077582 
End bp2079534 
Gene Length1953 bp 
Protein Length650 aa 
Translation table11 
GC content64% 
IMG OID640444708 
Productoligopeptide transporter OPT 
Protein accessionYP_497234 
Protein GI87199977 
COG category[S] Function unknown 
COG ID[COG1297] Predicted membrane protein 
TIGRFAM ID[TIGR00728] oligopeptide transporters, OPT superfamily
[TIGR00733] putative oligopeptide transporter, OPT family 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACTACAG GTTCGGGACC CCGGGAACTG ACGATCAGGG GCATCGTCCT CGGCGCGATC 
CTTACTGTCG TATTCACTGC CGCTAACGTC TATCTCGGCC TCCGGATCGG CCTGACCTTT
GCGACGTCCA TACCAGCCGC GGTCATTTCC ATGGCCGTCC TGCGCGCGTT TTCCGGGGCA
ACCATCCAGG AAAACAATAT CGTCCAGACC ATTGCAAGTT CCGCAGGTAC GTTGTCGGCA
ATCGTCTTCG TGCTGCCCGG TCTTGTCATG GTGGGATGGT GGGCCGACTT TCCCTATTGG
GAGTCGGTCG CGGTCATTGC GGTCGGCGGC GTGCTTGGCG TGATGTATTC GGTGCCGCTG
CGCCGTGCTC TCGTTACGGG ATCGGACCTC CCCTATCCGG AAGGCGTCGC CGCGGCCGAA
GTGCTGAAGG TCGGCGCTGG CGTTGGCGGA GAAGAGGAAA ACCGCAAGGG CCTCGCCGCC
GTGACTGCGG GCGGCCTGCT TGCCGCGCTC TATCCGCTCC TCGCCAAGAT GAAACTCGCG
GCCGAGGAAG TTGGCGGGGT GTTCAAGGTC GGGACCGGAG GCTCGATGCT GTTCGGCGGC
CTGTCGCTCG CGCTCGTCGG AGTGGGCCAT CTGGTGGGCA TCGCGGTGGG CATTGCCATG
CTCGTCGGCA TCGTGATTAG CTTCGGCGTC CTTCTCCCCC AGTTCACCAC CGGGGGGCCG
CCGGTCGGGA CAGAACTGGC CGATTTTGTC GGCACGGTGT TTCGCCAGAA AATCCGTTTC
ATCGGCGCTG GCACCATTGG CGTTGCAGCC ATCTGGACGC TGCTGCGCGT AATCGGCCCC
ATCGTGCGCG GCATTGCATC GGCCATCGCA GCAAATCGCG CGCGGAAGGG CGCCGGCATC
GCCAGTCTCG ACCTGGCCGA ACGGGACATT CCAATTGGTA TCGTGGGCGG CACGATCCTG
CTTGCGATGG TGCCCATTGC ACTGCTTCTC GCCGACTTTG CGGTCGGAGG CCCCGTCGCC
GCCGCGCTCG GCATGACGCT TATCGCTTCG GTCGTCTACG TGCTGGTCGC TGGCATCGTG
ATTGCTTCTG TCTGTGGCTA CATGGCCGGT TTGATCGGCG CTTCGAACAG CCCTATCTCG
GGTGTTGGTA TCCTTGCCGC GCTCGGTATT TCCCTGCTTC TGCTCGCCCT GTTCGGGCAA
GCGACCAATC CCGACGATAC CAAGGCGCTA GTCGCATTTG CGCTATTCGT CACGGCAATC
GTCTTCGGCG TAGCCACGAT TTCGAACGAC AACCTTCAGG ATCTCAAGAC CGGTCAGCTC
GTCGGCGCGA CGCCATGGCG CCAACAGATA GCTCTGGTCC TCGGCGTCCT GTTCGGTGCG
TTGGTCATCC CGCCGATCCT CGACCTTCTC AATTCCACTT TCGGGTTCCA GGGCGCTCCG
GGGGCAGGGG AGAACGCGCT GTCGGCACCC CAGGCCGCAC TCATCTCCGC TATTGCGCAG
GGTGTTCTTG GCGGCAGTCT CGACTGGAAC CTCGTCGGCC TCGGCGCTGC GATCGGTGCC
GGCGTCATCC TCGTGGACGA ACTGCTAAAG CGCTCGGGCA ATCGGTCCCT TCCTCCACTC
GCGGTGGGAA TGGGCATGTA CCTGCCAACG CAGGTGACGA TGCTGGTCAT CGCAGGCACG
GTGCTGGGAC ACCTCTACAA TCGCTGGGCC CTGCGTCAGT CCTCGCCCGA GCTTGCGGAG
CGCATGGGCG TCCTGACGGC TACCGGTCTT ATCGTCGGTG ACAGCCTGTT CAACATCGCC
TACGCCGCCA TCGTCGCGGC GACCGACAAT CCCGATGCGC TGGCTGTAGT TGACGGCTTC
CCGGCAAGCT TGCCGCTGGG CGTGGCGGTT TTTGCCGCGA TCATCGCCTA TGCATACTGG
CGGATGCGGC GGGACGGAGG CAACCCCGCC TAG
 
Protein sequence
MTTGSGPREL TIRGIVLGAI LTVVFTAANV YLGLRIGLTF ATSIPAAVIS MAVLRAFSGA 
TIQENNIVQT IASSAGTLSA IVFVLPGLVM VGWWADFPYW ESVAVIAVGG VLGVMYSVPL
RRALVTGSDL PYPEGVAAAE VLKVGAGVGG EEENRKGLAA VTAGGLLAAL YPLLAKMKLA
AEEVGGVFKV GTGGSMLFGG LSLALVGVGH LVGIAVGIAM LVGIVISFGV LLPQFTTGGP
PVGTELADFV GTVFRQKIRF IGAGTIGVAA IWTLLRVIGP IVRGIASAIA ANRARKGAGI
ASLDLAERDI PIGIVGGTIL LAMVPIALLL ADFAVGGPVA AALGMTLIAS VVYVLVAGIV
IASVCGYMAG LIGASNSPIS GVGILAALGI SLLLLALFGQ ATNPDDTKAL VAFALFVTAI
VFGVATISND NLQDLKTGQL VGATPWRQQI ALVLGVLFGA LVIPPILDLL NSTFGFQGAP
GAGENALSAP QAALISAIAQ GVLGGSLDWN LVGLGAAIGA GVILVDELLK RSGNRSLPPL
AVGMGMYLPT QVTMLVIAGT VLGHLYNRWA LRQSSPELAE RMGVLTATGL IVGDSLFNIA
YAAIVAATDN PDALAVVDGF PASLPLGVAV FAAIIAYAYW RMRRDGGNPA