Gene Apar_0506 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagApar_0506 
Symbol 
ID8413357 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAtopobium parvulum DSM 20469 
KingdomBacteria 
Replicon accessionNC_013203 
Strand
Start bp582659 
End bp584017 
Gene Length1359 bp 
Protein Length452 aa 
Translation table11 
GC content49% 
IMG OID645022076 
Productputative aminopeptidase 2 
Protein accessionYP_003179528 
Protein GI257784311 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1362] Aspartyl aminopeptidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0233423 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.547995 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGGATT TACATGAGTC ACTTGCTCTT TCTGAGGAGC TTCTTGCTTT TATTAAGCAG 
AGTCCTTCCA TGTTTCATAC TACACAAACT ATCAAAGACT ACCTGTTAGA GAACGGTTTC
ACTTACCTCT CTGAGGGTTC TTCTTGGGAT GTTCAGCCAG GCGGCTCTTA CTTTACAACA
CGCAACAATT CTTCAATTAT TGCCTGGAAA GTTGGCGAGA AATACCGTGA GGCTCAAACC
TCAAACGCTG ATACTCCTTA TCACTTCCAG CTTGCTGTTG CCCATGGCGA TTCTCCAACT
TACAAGGTAA AAGCCCAGCC AGAGCTTACT GGCGAGGGCA ACTCGCTTCG TCTGAACACT
GAGGCATACG GCGGCATGCT TGACCACACG TGGTTTGACC GTCCTTTGGG TGTTGCTGGC
CGTGTGCTGG TCAAGGTAGG AAACAAGGTA GAGTCCAGGC TGGTCAACAT CGAAGATGAC
GTTGTCATGA TTCCAAGCTT GGCTATTCAT CTTGAGCACA AAAATGGTCT CTCGCCAGAG
TTCAACCGTG CTAAAGATCT GATGCCACTT TTCAGCGTTG GAGAGCTCAA TCCCGGCGCC
TTTAACGCCC TGGTAGCAGA TGCAGCAGGT GCGTCTCAAG AGGACATTCT TTCTCGCGAT
CTCTTTTTGG TTGATCACAC AGGTGGTCGT ATTTGGGGCG CAAAGAAGGA GTTTGTTTCC
GCTGGTCATC TGGATGACCT GCAGTGTGCC TTTGTAGCAC TTAAAGCTTT CCTTGCGTCT
TCAAATGAGC AGGACATCTC TGTGTACACC TGCTTTGACA ACGAAGAAGT TGGCTCAAAC
ACTAAGCAGG GTGCTAAGTC TACGTTCCTT AAAGACACGC TACAGCGCGT AAACGCTACG
CTTGGCTTTA CGCAGGAAGA TTACTACCGT GCGCTCTCGG CATCTTTGCT AGTAAGCTGC
GACAACGCTC ATGCGGTGCA TCCCAATTAT CCTGAGAAGC ACGATGCGGC CAACAAACCT
TACCTCAACG GAGGTATGGT TATCAAGGAA GCAGCACGTC AGTCATACTG CACGGATGCG
TTTAGCCGCG CCATTGTCGA GGCAATTTGG AAGCAGCAAA ACGTTCCATA TCAGATTTTT
GCTAATAGAA GCGATATGCC AGGTGGATCT ACTTTGGGCA ACCTCTCCAA CATTCAGGCC
AGCATGCATG CCGTTGACGT GGGTCTGCCT CAGCTTGCTA TGCACTCTGT TTACGAAACC
GCGGGCACTA AAGATACACT TTTGGGGTAC CAGGCACTTA AGGCGTTCTA TGACACCTGC
GTCTGCATTA CTGATGCCGA TTCGTTTGAG TTGAGGTAA
 
Protein sequence
MSDLHESLAL SEELLAFIKQ SPSMFHTTQT IKDYLLENGF TYLSEGSSWD VQPGGSYFTT 
RNNSSIIAWK VGEKYREAQT SNADTPYHFQ LAVAHGDSPT YKVKAQPELT GEGNSLRLNT
EAYGGMLDHT WFDRPLGVAG RVLVKVGNKV ESRLVNIEDD VVMIPSLAIH LEHKNGLSPE
FNRAKDLMPL FSVGELNPGA FNALVADAAG ASQEDILSRD LFLVDHTGGR IWGAKKEFVS
AGHLDDLQCA FVALKAFLAS SNEQDISVYT CFDNEEVGSN TKQGAKSTFL KDTLQRVNAT
LGFTQEDYYR ALSASLLVSC DNAHAVHPNY PEKHDAANKP YLNGGMVIKE AARQSYCTDA
FSRAIVEAIW KQQNVPYQIF ANRSDMPGGS TLGNLSNIQA SMHAVDVGLP QLAMHSVYET
AGTKDTLLGY QALKAFYDTC VCITDADSFE LR