Gene Apar_1261 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagApar_1261 
Symbol 
ID8414140 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAtopobium parvulum DSM 20469 
KingdomBacteria 
Replicon accessionNC_013203 
Strand
Start bp1414174 
End bp1415148 
Gene Length975 bp 
Protein Length324 aa 
Translation table11 
GC content39% 
IMG OID645022853 
Productsortase family protein 
Protein accessionYP_003180277 
Protein GI257785060 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG3764] Sortase (surface protein transpeptidase) 
TIGRFAM ID[TIGR01076] LPXTG-site transpeptidase (sortase) family protein 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTAATC CAAAACACAT GATGGCTGTA TTGAAAGCAG GAGAATCAAA CGATTCTCCT 
GCTTCTCTTA ATGCCTCGGA TAAAGGTATA GAAAAAAAGA AGCGTGCGCG CGTATGGCTA
ATTATGTCGA TTATTGTTTT GGTTGCTGGT GTTGCGTTAA TTAGCTATCC ATTTGTAAGT
AACTGGCTTA ACCAACTTAC TCAAAATAAT GTGAGTGCTA CCCAAGAAAA TACAGTAGTT
ACAATGTCTA AGACAGACCT TTCTTCTGAA AAGGAAAGGG CAATTGAATT TAATAAGCAC
CTTCGTGATG GTGCTTCTAG GGTTATTGAC CCCTTTGATA GCAAAGAATC TATGCCAGGG
GTTACTGAAT ATAAAGAAGT GCTGAACATA GCAAATGATG GTGTTATGGG AGAGCTTATA
ATACCAAAAA TTTCAGTGAA TCTTCCTATT TATCACTTTA CAACTGATGA TGTGTTACAG
CATGGTGTTG GTCATGTAGT AAATACTTCT GTGCCAATAG GCGGGGAGTC AACGCATACT
GTTTTAGCAG GTCATACCGG TCTGCCTACC GCCCGTATAT TTGATCGACT CAATGAACTT
CAAGCAGGTG ATTGGTTTAT TATTCATGTT CTTGGGGAAG ATCATGCGTA TAGAGTAACT
TCTACTGAAG TAGTTTTACC TAATCAGGTA GATAGTCTTT TTATTGAGCC AGGGAAAGAT
CAAGTAACGC TTGTAACGTG TACTCCTTAT GGTGTTAATA CTCATAGGTT ATTAGTACAT
GCAGAGCGCA CGGATGTTCC TGCCGAATGG AACGATCAAA ATGAGTTGAC TAATCGTTCC
ATTGATTCTT CGGTAGATAT GAGTCGACAC CCAATATTAT TCTCTATTTT GGGTATTGTA
TGTGCTGGTG TAGTTGTCAG TATTGCGGCT TTTATCGCTA AACGTACAAA AGTTTTTTCA
AAAATAAAGA AGTAA
 
Protein sequence
MANPKHMMAV LKAGESNDSP ASLNASDKGI EKKKRARVWL IMSIIVLVAG VALISYPFVS 
NWLNQLTQNN VSATQENTVV TMSKTDLSSE KERAIEFNKH LRDGASRVID PFDSKESMPG
VTEYKEVLNI ANDGVMGELI IPKISVNLPI YHFTTDDVLQ HGVGHVVNTS VPIGGESTHT
VLAGHTGLPT ARIFDRLNEL QAGDWFIIHV LGEDHAYRVT STEVVLPNQV DSLFIEPGKD
QVTLVTCTPY GVNTHRLLVH AERTDVPAEW NDQNELTNRS IDSSVDMSRH PILFSILGIV
CAGVVVSIAA FIAKRTKVFS KIKK