Gene Apar_1352 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagApar_1352 
Symbol 
ID8414242 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAtopobium parvulum DSM 20469 
KingdomBacteria 
Replicon accessionNC_013203 
Strand
Start bp1524370 
End bp1525632 
Gene Length1263 bp 
Protein Length420 aa 
Translation table11 
GC content44% 
IMG OID645022954 
Productpeptidase M29 aminopeptidase II 
Protein accessionYP_003180367 
Protein GI257785150 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2309] Leucyl aminopeptidase (aminopeptidase T) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00264479 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.638771 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACAGACC AAGAGGTAAA GATTGCTCTT GATGCTCTGT CTGATCAGAT TGATAAGTAC 
GCTGAGCTTC TTGTGAAGAA GGGATGCGCA ATTTCTGAGG GAAGTCAGCT GGTAATTAAC
GCATCAATTG AGATTGCTGA TTTTGTTCGT AGAGTTCAAC GAGCAGCTTA TGCGGCTGGT
GCGGAATTTG TTACGGTTAT TTGGGGCGAT CAAGAGTCCT CAAGAATCAT GTATGAAAAC
GTCGATCTAG CACGTCTTTC TAAGACGCCA AGTTGGAAGA TTGAGCAGCT TAATTCTCTA
GCGGAGCAGG GCGCAGCATT TTTGTTCCTC TCATCTGATG ATCCAAACGG TTTAAAAGGT
ATTGATCCAG AGAAGCCAGC TGCTGTTTCT CGTGCAAGGA ATCTGGAATG TGACGTTTTT
AGAAATGGTA TGGATTTTGG CAAAAACGTT TGGTGTATTG CCGGTGTTCC TGCAGCTGAA
TGGGCAAAGG TAATTTTCCC AGAGCTTTCT GAGTCCGAGG CAATTTACAG GCTTTGGGTT
GCTATTTTGG ACGTTGCTCA TGCAAGCGGA GAAGATCCTC AAAGTGCTTG GGAGACTCAT
AATGCAACAT TTGAGAAGAC CAAGCGTTTC ATGAACTCCC ACCAGTTTAA AGAGCTTCGC
TATGAGTCTT CAAATGGAAC TAACCTGACT ATTGGCATGA ATCCAGGACA TCTTTGGGAA
GGTGGCGCTG GAAAGACACA GGATGGAACG TTGTTCTTCC CTAATATTCC TACTGAGGAA
GTTTTTACAA CACCAAACTA TCGCAAGGTA AACGGCACCG TTCATTCTGC GCTACCTCTG
ATTCACGCTG GTCAGATTGT TAAGAACTTT TGGTTTACCT TCAAAGACGG TGAGGTAGTT
GATTACGGCG CTGAGCAAGG CAAAGATGTT CTGACCTCTA TTGTTTCTCA AAAGGGCGGT
AAATATCTGG GAGAATGCGC CCTTATCTCT AAGAATACTC CTATTCGCCA AAGCGGTATT
TTATTCTATG ACACTCTGTA TGATGAGAAT GCCAGTTGTC ACTTGGCTCT TGGAATGGGC
TTCCCAGAGT GTCTGAAAGG CGGTCTTGAG ATGAACTCAG ATGAGCTCTT GGCCCATGGT
GTTAATCAGA GCACTACGCA CGTTGATTTT ATGATTGGTG CAGACGATTT GAATATTTGG
GGAATTTCTG ACGATGGAAA AGAAATACCA ATTTTCGTAA ATGGACAATG GGCATGGGAG
TAA
 
Protein sequence
MTDQEVKIAL DALSDQIDKY AELLVKKGCA ISEGSQLVIN ASIEIADFVR RVQRAAYAAG 
AEFVTVIWGD QESSRIMYEN VDLARLSKTP SWKIEQLNSL AEQGAAFLFL SSDDPNGLKG
IDPEKPAAVS RARNLECDVF RNGMDFGKNV WCIAGVPAAE WAKVIFPELS ESEAIYRLWV
AILDVAHASG EDPQSAWETH NATFEKTKRF MNSHQFKELR YESSNGTNLT IGMNPGHLWE
GGAGKTQDGT LFFPNIPTEE VFTTPNYRKV NGTVHSALPL IHAGQIVKNF WFTFKDGEVV
DYGAEQGKDV LTSIVSQKGG KYLGECALIS KNTPIRQSGI LFYDTLYDEN ASCHLALGMG
FPECLKGGLE MNSDELLAHG VNQSTTHVDF MIGADDLNIW GISDDGKEIP IFVNGQWAWE