Gene Apar_1023 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagApar_1023 
Symbol 
ID8413896 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAtopobium parvulum DSM 20469 
KingdomBacteria 
Replicon accessionNC_013203 
Strand
Start bp1157517 
End bp1159499 
Gene Length1983 bp 
Protein Length660 aa 
Translation table11 
GC content41% 
IMG OID645022613 
Productmembrane protein-like protein 
Protein accessionYP_003180043 
Protein GI257784826 
COG category[S] Function unknown 
COG ID[COG4907] Predicted membrane protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTATAAAA TCTCGCTTCG CTCTTCTAAA ATGAGCCAGT TTGTAAGGAT TGCACTTTGC 
ATAATGCTGT TGGCTGTTGC GTTTCTAATT GCTCCAACAA AAGCATTTGC ACGTGATCTT
TCAATAGATA GGGTAGATAT TGATGCGACG GTTCAAAAAG ACGGAACGTT GCATGTTGTA
GAAACTCGTA CATTTTATTT TAAAGGAAGC TATCACGGTG TTTATTGGAA TCTTCCAGTT
GGAAAGAATA AATACAATGG ACAGAACGTT GAAATAAATA TCACTTCAGT TACTGTTCAA
GATAGTCAAG GATCACGCCA GATTTATAGT AAAAACTCTA GTGGAAACAG CGCCAACTCA
GATGAGTATT ATGTTCCTAC TCAATCAAGC GATATGTTGA ATTTGAAAAT TTATTCAGCA
CATGATGATG AATCTGCAAA TATAACTATC GAGTATGACA TTACTAATGT TGTAACAAAT
TGGTCTGATA CTGCAGAGTT GTATTGGAAG TTTGTATCTG ATGGTTGGGA TTCAGAGTCT
CATAATGTAA CAGCTACCAT CCATCTTCCG GTTCCTTCAG GTCAATCGAT CGTTGCTGGA
GATACTGTCC GTGCTTGGGG ACACGGCCCG CTGGATGCAA ACGTAGAAAT TACTAAGAAT
GCAGTTGTTT ACAAAGTTCC TGGTGTAGGA ACTGACGAGT TCGCGGAAGC TCGTATTACG
TTCCCAACAG ATTGGGTTTC TGGGCTAGCT CCTATTCAGC AGAATAAATT AAGCAGCATT
CTATCTGAGG AACAAGCATG GGCAGACGAA GCAAATCACA AGCGTGAGCT TGCAAAAATT
CAGGTAGCCC TCATACTTTA TGCGCCGTTA GTATTTGCAG CTATCACCTT TGTGGCAGCT
ATTCTGCTAC GCATTAAATA TAAAAAGGTC ATGGCTCCTC AGTTTAATGA CCTTTATTAC
AGGGACGTTC CATCTAATGA TCATCCAGCA GTTCTTTCTA TGCTTTATAA CGGTGAGGGA
ATTAAGGGAG ATGCTCTGAC GGCAACTCTT ATGCACCTTT CTGATGCAAA GTACATTAGT
TTGAATAAGA CAGTTAATAC AAACTTCCTT GGTATGGAAA AGTCTGATTA TTGTCTTACA
AAGGTCCAAG ACTTTCAGGA AGACCAGCCA TCTTTCTATC CTGGTAGTTC TTTATCAAAT
AAGATTGATA GCATGGCTAT GCGATTGATT TTCGATAAGG CCGGGGAATC TGGAGCGGTA
AATACTACCG TTACTATGAA ACAAATTGGC AAATATGCCT CTGATCATCC AGAAGATTTT
AATAAGGCGT ATGAGTCATG GGAGAGCCCT ATTAAGTTTA GCTATGCTGC TAAGTTTGAA
ACAAATAAGA TCCCATTCCA TGGCAAGGGA ATTCTTGGTG CTCTGATTGC TGTTGATATT
CTTGTTGCAG CTGCTGCATT CTTTATGGGT ATTGTGGCGG ATGTTTCGAT TGGTAATTTA
CTCTTTAATT TACTGCTTCT TGTTATTGCC GGTCTTATTG CGCTTATAAA CATTGGTAAG
TTTGATTTGA TGAACAGAGA GGGTGTTGAG ACGCTTGCTA AGACCCGTGC TCTAAGAAAG
TGGCTTACTG AATTTACTAA TCTTCATGAG GCAATTCCAA CCGACGTTAT TCTTTGGAAT
AGGCTTTTAG TTATGGCAGT CGCGCTTGGT GTGGCAGATA AGGTTATTAG TCAGTTGAAA
GTTGCAATGC CCGAGTTGCT TAAGGATCCA CAGTTTATGC CCGTTTATAG TTGGTATTAC
TATGGAAATG GCATGCGTGT TAATGCAATC GATAGCGTAA CCAAGAGTGT TACAGCTGCT
CATTCTGTCT CTACCGCGGC ACTTGCTTCT TCTAGTTCAA GCTCTGGTGG CGGATTTGGT
GGCGGCTTCT CCGGCGGAGG CGGCGGTGGC TTCGGCGGCG GAGGCGGCGG AGGCGGTTTT
TAA
 
Protein sequence
MYKISLRSSK MSQFVRIALC IMLLAVAFLI APTKAFARDL SIDRVDIDAT VQKDGTLHVV 
ETRTFYFKGS YHGVYWNLPV GKNKYNGQNV EINITSVTVQ DSQGSRQIYS KNSSGNSANS
DEYYVPTQSS DMLNLKIYSA HDDESANITI EYDITNVVTN WSDTAELYWK FVSDGWDSES
HNVTATIHLP VPSGQSIVAG DTVRAWGHGP LDANVEITKN AVVYKVPGVG TDEFAEARIT
FPTDWVSGLA PIQQNKLSSI LSEEQAWADE ANHKRELAKI QVALILYAPL VFAAITFVAA
ILLRIKYKKV MAPQFNDLYY RDVPSNDHPA VLSMLYNGEG IKGDALTATL MHLSDAKYIS
LNKTVNTNFL GMEKSDYCLT KVQDFQEDQP SFYPGSSLSN KIDSMAMRLI FDKAGESGAV
NTTVTMKQIG KYASDHPEDF NKAYESWESP IKFSYAAKFE TNKIPFHGKG ILGALIAVDI
LVAAAAFFMG IVADVSIGNL LFNLLLLVIA GLIALINIGK FDLMNREGVE TLAKTRALRK
WLTEFTNLHE AIPTDVILWN RLLVMAVALG VADKVISQLK VAMPELLKDP QFMPVYSWYY
YGNGMRVNAI DSVTKSVTAA HSVSTAALAS SSSSSGGGFG GGFSGGGGGG FGGGGGGGGF