Gene Apar_0221 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagApar_0221 
Symbol 
ID8413069 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAtopobium parvulum DSM 20469 
KingdomBacteria 
Replicon accessionNC_013203 
Strand
Start bp259801 
End bp260922 
Gene Length1122 bp 
Protein Length373 aa 
Translation table11 
GC content39% 
IMG OID645021789 
Productprotein of unknown function DUF1648 
Protein accessionYP_003179244 
Protein GI257784027 
COG category[R] General function prediction only 
COG ID[COG4194] Predicted membrane protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.263537 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.853287 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACATCA TTTCAATTAG TGCACAGACA ATAGTAACGT TTGTAGTTGG ATTATTTCTT 
ACTTTCATTC CGTATATAAC ACGCCATAAC GAGTGTTTTG CTGTGACGGT ACCAGTTTCT
GCTCAAAAAG ATTCACGTAT GATTTCTTTA AAGAAACGCT ATGTTGTGGA AATGCTTCTT
ACAACAATAC TAGCAACCAT TTCTTCTGTG ATAGCAGGGA AGTTACTAAC AACTAATCAA
ACTATAGCCG GCTTAACTCT TCTATATAGT GCTTTAGTGA TTCCTGCAAT TGTCTCTTTT
GTGTTAATGT TGCATGCCCG TTCTAAGGTA ATAGCCCTTA AGAAATCTGA AGGTTGGGAT
TTCGAGCAAC ATAAGATGAC TGCTAGTGTT GTCGAAAAAG ACTTCCCGAA TCCAATCTCT
TTAAGATGGA ACCTTATGTA TATTCCTATT ATTTTGGGAA CAGTTTGCTT GGGATTTGTT
CTTTATCCGA GTATGCCTGA TATGTTGCCT ATGCACGCCG ATTTCACAGG AACAATTGAT
AGCTACACGC CAAAAACGTT TGGTAGTGCC CTTGGATTTC CTGTTGCATT TGAAGTCTTC
ATGGCGGCAT GCTTTATCTT TTCTCATTGG ATGATTGTGC ATTCAAAACA TGCGGTTGAT
CCTAGTGAGC CAGCTACTTC TGCATTTTCA TATGGAGTTT TTGCTCGTGC TCAGAGCATA
TTTCTCTTCA TAATAGGCTT ACTTATAAGT GGCGGTCTTG GTGTTTTGTT TATTCTTGCA
TCAGCAGGGC GTATTAGTCT TGGACAAGTG GGATTTATCG CTGAAATTTT TGCTGTGCTT
ACCGTCGTTG GTATTTTGGT ACTTTCAGCT GTCTATGGTC AGTCAGGTTC ACGAGTATTT
AGGAAGTTAG ACCACAACGA GAACTACCTA TCAGATGAGG ATAGACATTG GAAACTTGGC
GTCTTTTATT TTAACCGTGA AGATGCAAGC ATCTTTTTAC CAAAACGATT TGGTGTTGGA
TGGACTATGA ACTTTGCACG ACCAGCTGTT TGGGTAATTA TCGTGGGTCT TATTATTTTT
CCTATAGTTT TTGTTGTACT TGTTTCTTAT TTGGCGGGGT AA
 
Protein sequence
MDIISISAQT IVTFVVGLFL TFIPYITRHN ECFAVTVPVS AQKDSRMISL KKRYVVEMLL 
TTILATISSV IAGKLLTTNQ TIAGLTLLYS ALVIPAIVSF VLMLHARSKV IALKKSEGWD
FEQHKMTASV VEKDFPNPIS LRWNLMYIPI ILGTVCLGFV LYPSMPDMLP MHADFTGTID
SYTPKTFGSA LGFPVAFEVF MAACFIFSHW MIVHSKHAVD PSEPATSAFS YGVFARAQSI
FLFIIGLLIS GGLGVLFILA SAGRISLGQV GFIAEIFAVL TVVGILVLSA VYGQSGSRVF
RKLDHNENYL SDEDRHWKLG VFYFNREDAS IFLPKRFGVG WTMNFARPAV WVIIVGLIIF
PIVFVVLVSY LAG