Gene Apar_1284 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagApar_1284 
Symbol 
ID8414164 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAtopobium parvulum DSM 20469 
KingdomBacteria 
Replicon accessionNC_013203 
Strand
Start bp1441805 
End bp1443112 
Gene Length1308 bp 
Protein Length435 aa 
Translation table11 
GC content49% 
IMG OID645022876 
Productputative sugar-specific permease SgaT/UlaA 
Protein accessionYP_003180299 
Protein GI257785082 
COG category[S] Function unknown 
COG ID[COG3037] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.788956 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTCAATG CAATTCTCAT GCAGCTCAGA GATACAGCCG TTATTATGGG CATTATTGCT 
CTTGTCGGTC TCTTGCTGCA GAAGAAGTCC GCTGTGGACG TCTTCTCCGG AACTGTGAAG
ACCATTATTG GCTTCATGAT CTTCAACATT GGCTCTAGCG CTATGTCCGC TCAGGTCAAC
ATCTTCAGTG ATATGTTTAG TCGTGCGTTC GCTGTTAAGG GTGTTGTAAC CCAGGTTGAG
GTCGCAACTG CTCTTGCACT GAACACCTAT GGTACTGAGG TTGCTCTTGT AATGGTCCTT
GGCTTTATCA TGAATCTTGT GTTTGCTAAG GTCACCCCAT TCAAGTCCGT CTTCTTGACT
GGCCAGCACT TCCTGTACTT CTCCTGCGTA CTTGCGCTAG TCTTCATTGC TCTTGGCCTC
CCAATGTGGG TTACCATTGT TCTTGGTGGC GTCATTCTCG GCTTCTGTGG TGCTGCACTG
CCTTCGCTGT GCCAGCCATT TGTAACTAAG CTTGTTGGTG GAGATTCCAT CGCAATTGGC
CACTTCAACT GCATCGGTTA TGCATTCTCC GGTTACGTTG GAAAACTCTT CGCCAAGAAG
AACGAGAAGG ACAGTGAGAA AGAGGCTAAG GAGCTCCCAG AGTTCTTTAA GCTCTTCAAG
GACTTTGTCT TCTCCGTTGC TCTCTTCATG ATTGTTCTAT TCTATGTTGT TACTATTGCA
TGCCTTGTCA CCGGTCACTT TGGTGACACT CTTGCAAACA ACAAGCCATT CACTAGCTAC
TTTGGCAACG ATATGTGGTG GATTTGGCCA TTCCTTGCTG GTCTTCAGTT TGCTGCTGGC
ATGTCCGTTC TTGTTTACGG TGTTCGCCAG TTCATCGCTG AGATTACCGC TGCATTTGTC
GGCATCTCTG AGAAGCTCAT TCCAGATGCA CGTCCTGCAG TCGACTGCCC TGCAATCTTC
CCATTTGCAC CAAACGCTGT CATCATCGGC TTCATTGGAT CCTTCCTTGG CGGCCTTGTA
GCAATGGCAC TTATGGTTGC GTTCCACAGC CCAACCATCA TGATTCCTGC AGCAGGCATC
TGCTTCTTCT CTGGTGGTAC TTGTGGTGTT TGTGGTTACG CTTACGGTGG ATGGCGTGGC
GCTCTTCTTG GTTCCTTCTT GGTTGGCATT TTCCTGACCG CTGGTCCTCT GATTCTCTAT
CCTGCATTTG CTCAGCTGGG AATTGCAGAG GCATCCTTCC CTAATGTTGA CTACAACATC
GTTGGTTCTA TCATCTACGG CATCGGCTCT CTCTTTGGTC TTGCCTAA
 
Protein sequence
MLNAILMQLR DTAVIMGIIA LVGLLLQKKS AVDVFSGTVK TIIGFMIFNI GSSAMSAQVN 
IFSDMFSRAF AVKGVVTQVE VATALALNTY GTEVALVMVL GFIMNLVFAK VTPFKSVFLT
GQHFLYFSCV LALVFIALGL PMWVTIVLGG VILGFCGAAL PSLCQPFVTK LVGGDSIAIG
HFNCIGYAFS GYVGKLFAKK NEKDSEKEAK ELPEFFKLFK DFVFSVALFM IVLFYVVTIA
CLVTGHFGDT LANNKPFTSY FGNDMWWIWP FLAGLQFAAG MSVLVYGVRQ FIAEITAAFV
GISEKLIPDA RPAVDCPAIF PFAPNAVIIG FIGSFLGGLV AMALMVAFHS PTIMIPAAGI
CFFSGGTCGV CGYAYGGWRG ALLGSFLVGI FLTAGPLILY PAFAQLGIAE ASFPNVDYNI
VGSIIYGIGS LFGLA