Gene Oter_1004 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagOter_1004 
Symbol 
ID6204803 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameOpitutus terrae PB90-1 
KingdomBacteria 
Replicon accessionNC_010571 
Strand
Start bp1234459 
End bp1236270 
Gene Length1812 bp 
Protein Length603 aa 
Translation table11 
GC content66% 
IMG OID641690627 
Productgeneral substrate transporter 
Protein accessionYP_001817892 
Protein GI182412826 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00016386 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
ATGTCCACCT CGCCTGCACC TGTTGTGATC CCCGACAACC GCAAGAGCGG ATACAATCGC 
TTCCTGCTGC TCGTCGCCGG CCTGGGCGGC CTGCTCTACG GCGTCGATGT CGGCATCATT
GCCGGCGCCT TGCCGTATCT TGAGGCCACA TCCGGGCTCA ATGCCGGCCA GCTCTCGTTC
ATCGTCGCCG CGGTGCTGCT CGGCAGCGTC ATCTCGACGC TGTTCGCCGG ATTGCTCGCC
GACTGGCTTG GCCGCAAACC GCTGATGATC GCGAGCGGCG TGCTCTTCGT CGCGAGCATC
CCGATGATCG CTCTCGCCGA CGGCTACGTG CCGCTGGTGC TGGGCCGGCT GCTGCAAGGT
GTCAGCGCGG GCCTGATCGG GGTCGTGGTC CCGCTGTATC TCGCGGAGTG TCTGGGCGCC
GCTCACCGCG GGAAGGGCAC CGCGATTTTC CAATGGTTGC TGACCCTCGG CATCGTCGCC
GCCGCGGCCA TCGGCATGTA CTTCAGCATC CGCGTCGAGG ACGTCGCGCG GCTCGGCTCG
CCCGCGGAAC TGTTCGCCTT CAAGGACCAG GCGTGGCGCA GCATCTTCTG GGTCTCGCTG
CCACCCGGCG CGCTCTTCGT GCTCGGCGGT TTTCTGGTGG CGGAGTCTCC GCGCTGGCTG
TTCCGCCGCG GCCGCACCGA CGCCGCACGC ACCGCGCTGC TGCGTTCCCG CTCCGACGAG
CAGGCGACGA TCGAGCTGCA GGAAATGGCC GCGACGATCG CCGCCGAAAA AGCTCAGGCC
GCCACCGCCG GTGCGCGCGT GCGCGAATCA CTGCTGCGCC GCAAATACGT CGTGCCCTTC
GTGCTCGCCT GCGTGATCCT CGCCTGCAAC CAGGCCACCG GCGTCAACTC GATCATCGGT
TACAACACGA CGATCCTGCT GCAGAGCGGC CTGTCCGACG TGCAGGCCCA CTGGGGCTAC
CTCATCCTCA CGCTCGTCAA CTTCCTCGTC ACGATCGGTG CCGTCGTGCT GGTCGATCGC
AAGGGCCGGA AGTTTCTCCT CTCGCTGGGC AGCGCCGGCA TTATCGTGTC GCTGGTGTTC
GTCGGGCTGA TGTTCCGCCA ACCGGAGTCG CGGCGCGTCG ACGTGCGCGA AGCGGTCCAA
GCCACCGTGT CGATCGAACA AACAGCGACA GTGCCGTTCA ACACTGCGAC GCTGCCAACG
CTGCTGGCGA CCGCCGGTGA AGCCGGCCGG GCGATCGCAC GCGGTCCGGC GACACTGGTG
GTGATTTATT CCTACGGCGA CTTTCGCACC GCCACCAAGG CCGTGCGCTC CGATGACCCC
GCCGCGGCGC CGCTTGAGCT GACCCGCGCC GGTTGCGTGC CGCCCAACAA GGTCGTCGCC
TTCTTCTCGA ATCCGTTCGC CGATCTCGCC ACCGCCCGCA CCGCTCCGCT GAAAATCGAG
AATGCACTGA TCACACCGGT GCCGACGGAG CGTCACGGCT GGCTGACGGC CATCGGCATT
TTCACGTTCA TGGCGTTCTT CGCCGTCGGC CCGGGCGTGT GCGTCTGGCT CGCGTTATCC
GAGCTGATGC CAACGCGGAT CCGTTCCAAC GGGATGAGCA TCGCGCTGCT CATCAACCAG
GGGGTGTCCA CGACCATCGC GGCCGTATTC CTGCCCACCG TCGGCCGGTA CGGCTACTCG
ACGATCTTTT TCCTCTTCGC CGGTTGCACC GTCGTCTACT TCGTCACCGC CACGTTCCTC
TTGCCGGAAA CCAAGGGCAA GACGCTCGAG GAAATCGAAG CTCACTTCAG TCGCCGCGGA
AAAAAGGCCT GA
 
Protein sequence
MSTSPAPVVI PDNRKSGYNR FLLLVAGLGG LLYGVDVGII AGALPYLEAT SGLNAGQLSF 
IVAAVLLGSV ISTLFAGLLA DWLGRKPLMI ASGVLFVASI PMIALADGYV PLVLGRLLQG
VSAGLIGVVV PLYLAECLGA AHRGKGTAIF QWLLTLGIVA AAAIGMYFSI RVEDVARLGS
PAELFAFKDQ AWRSIFWVSL PPGALFVLGG FLVAESPRWL FRRGRTDAAR TALLRSRSDE
QATIELQEMA ATIAAEKAQA ATAGARVRES LLRRKYVVPF VLACVILACN QATGVNSIIG
YNTTILLQSG LSDVQAHWGY LILTLVNFLV TIGAVVLVDR KGRKFLLSLG SAGIIVSLVF
VGLMFRQPES RRVDVREAVQ ATVSIEQTAT VPFNTATLPT LLATAGEAGR AIARGPATLV
VIYSYGDFRT ATKAVRSDDP AAAPLELTRA GCVPPNKVVA FFSNPFADLA TARTAPLKIE
NALITPVPTE RHGWLTAIGI FTFMAFFAVG PGVCVWLALS ELMPTRIRSN GMSIALLINQ
GVSTTIAAVF LPTVGRYGYS TIFFLFAGCT VVYFVTATFL LPETKGKTLE EIEAHFSRRG
KKA