Gene Apar_0756 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagApar_0756 
Symbol 
ID8413621 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAtopobium parvulum DSM 20469 
KingdomBacteria 
Replicon accessionNC_013203 
Strand
Start bp837579 
End bp838775 
Gene Length1197 bp 
Protein Length398 aa 
Translation table11 
GC content51% 
IMG OID645022338 
ProductNLP/P60 protein 
Protein accessionYP_003179776 
Protein GI257784559 
COG category[M] Cell wall/membrane/envelope biogenesis
[S] Function unknown 
COG ID[COG0791] Cell wall-associated hydrolases (invasion-associated proteins)
[COG3883] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.640011 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCACAGC GCCGATCTTT CACAAGACGA GACGCTCTCT TATTTGCAGG TCTTGGTGTA 
GCTGCTACGC TGCTTACTCC AGCTAGGCTA TTCGCAGACC CACAAAGCGA TCTTGAGGCC
GCATCTGCCC AGTTAGACTC ACTTGGTGCA GCTCTTGCAG AAGCCATGGA TAACCTTAAC
GAGAAAACCT ATGCACTTGA CGCTACCAAC AACAAGATTG GTGAAGTTCA AGAGCAGATT
GCAGAAACCA CAAACCAGCT CAATCAGCAG CGCCTGGTCC TTTCTGCAGC AATGAGGAGT
GCTTATAAAG CAGGCCCACA GGAAACTTTG GACTTCCTCC TTGGAGCATC AAGCCCAGAG
GACTTTGTCA GCCGTGTCTA TTATATGGAC CGCACTAGCA AGCAAGAGGC TGACTCCATT
AATACGGTTA AGACTCTTGG CGATCAACTT CAGGCTCAGC AGCTTGAGCT TCAAGCCGAG
CAAGAGAATC TCCAAGCTCA GGTCGCAGAA ATGAAAACAA CCGCAGATGG TCTTCAAAGC
CAAGTTGCGG AAGCTAAGGC TTACTATGAC TCTCTTGATG CAGAGGTAAA AGCACAGCTT
GCCGCTCAAG AGGCTGCATC AGCAAATAAC AACGTTGCTT ATGCAATTGA GACTGTTACT
CGTGAGAACC CCTCTAACAG CTCTGAGTCC AACGATAGCC CCTCTAACAG TTCAAATTCA
AACTCCAGCT CTTCGAGCTC TAGCAACTCC AGTAGCAGTT CTAACTCTGG TAGTGGCTCT
AACTCTAGTA GTGGGTCTGG TTCAGGCAGC GGATCACACT CCAGCGGTGG TGGCGGTGGT
TATCCAGCCG CAGGCGGCGG CGTTGCCACT GCTTATGCTT GCATTGGCTA TCCATACGTT
TGGGGTGGAG CTTCTCCTGC TTCTGGCTTT GACTGCGGTG GTCTGGTCTA TTACTGCTTC
CTTGGTTATC GCAAAGGTAC TGCAGGAACC ATTGGACGCG CCATTCGTGC TGCTGGTAAC
TGGCACGATT CCATGGATGA GCTCAATTAT GGTGACATCA TCTTTACCCG TGCAGGCTAC
GAGCATGTTG GAATCTATAT CGGCGGTGGT CGCATGATTC ATGCAGCCAA CGAGTCTGTC
GGCGTCATCG AGGGTCCTGT TTACGCTTGC TATGGTGGAG GACCATTCTC TGGCTAA
 
Protein sequence
MSQRRSFTRR DALLFAGLGV AATLLTPARL FADPQSDLEA ASAQLDSLGA ALAEAMDNLN 
EKTYALDATN NKIGEVQEQI AETTNQLNQQ RLVLSAAMRS AYKAGPQETL DFLLGASSPE
DFVSRVYYMD RTSKQEADSI NTVKTLGDQL QAQQLELQAE QENLQAQVAE MKTTADGLQS
QVAEAKAYYD SLDAEVKAQL AAQEAASANN NVAYAIETVT RENPSNSSES NDSPSNSSNS
NSSSSSSSNS SSSSNSGSGS NSSSGSGSGS GSHSSGGGGG YPAAGGGVAT AYACIGYPYV
WGGASPASGF DCGGLVYYCF LGYRKGTAGT IGRAIRAAGN WHDSMDELNY GDIIFTRAGY
EHVGIYIGGG RMIHAANESV GVIEGPVYAC YGGGPFSG