Gene Apar_1355 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagApar_1355 
Symbol 
ID8414246 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAtopobium parvulum DSM 20469 
KingdomBacteria 
Replicon accessionNC_013203 
Strand
Start bp1526910 
End bp1529165 
Gene Length2256 bp 
Protein Length751 aa 
Translation table11 
GC content43% 
IMG OID645022958 
ProductCna B domain protein 
Protein accessionYP_003180370 
Protein GI257785153 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG4932] Predicted outer membrane protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000416795 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTTATCA CAAAATCCTC AAAACCTAAG GAAAAAATAA CTGCGCAGAA ACGCTCTCTT 
GCTTCTGCTT TTCACTTTTG GTTTATTCCC CTATTGATTG GAGGACTTTT ACTTAGCCAT
CTTTTACTAA AGCCTGCTCA TAAACTTGCC TTTGCCCAAG ATCCTTCTTT TACTGTTGCC
CAAAGAGAAG ACTTCCCCGG TGGACAACAA ATTCCCATGT GGACTGCTAG CGATGGCACT
TATCTCTACT GTGGAGACGA GTTTAATCAC TACGGCGCAT GGCCAGGAGC AACTGGAAGT
ATAGTTGACC CACAATCTTA TACAAAACTT ACTTCAGCTA CCGGCGCTCG TGGTGGAACT
TATACAGATG AGCAACTTCG AGCTATTGAC TACATCATTT ATCACGGAGC TACTGCTTCC
CAAGAGAAAG ACGTTTATGG CTATACGGGC TGGAAGGCGC GAGCTATCAC ACAGTTTGCG
CTTTGGGCTG TTATGCGCGG TGAAGCTCAT ACCCTTGCGG TTAGCGTCCC TCAAGAAGAA
CTCCATCAAC CAATCGAGAG ATTCTACACC GATGCTGTGA ACTACGCCCA CAATAACAGC
GGCGGTCCAG AGAACGGGAT TGCAAAGCTT TTTGTACCCG CAGGTGACAA ACAGGTCCTA
TTTTTCTTCG GAGAACAATC TGGCTCTCTC AAGATTACTA AAAACTCCTT ATTGCCTGCT
ATTACTTCCA ATAATGAGCA TTACGCCTTA GAAGGCGCTG TCTACGAAGT ATATTCCGAT
GAAGGTTGTA CCAATCTTCT TGGTAGTCTC ACACTTGATG CATCTGGATC CGCAACAATC
GACGGACTGC CTGTTGGATG TGTATATGTA AAAGAAACCT CTGCTCCAAA AGGTTTTCTT
CTTGATCCAA CCGTTCACAC CGTAGAAATA AAAAACCAAG AAGAAAGCAC TCTCGCGGTT
ACCGATACTC CTATCGGGGA CTTCAATCTA CACATTTCAA AACAAGACTT CGACCACAGT
GCCAACCTCG ATGAGGGCAA CCAAGCAGAG CAACAGGGTA CTTCTCCCCA AGGAAACGCA
ACGCTACAAG GTGCACTTTT TAAAGTAACC TATAGCGGAT CCTCCGAAAC AACGTTAAGA
ACATGGGTTT TTTCGACCGA CGCTCAAGGC TTTACTTCTT TTGATGCAGA CCACAAGGTT
TCTGGAGATG ACCTCTTTAC TCTTGACGAG AAACCCTGGC TTCCTCTGGG GTATTATCAA
ATCGAAGAAA TTCAAGCACC TAAAGGTTAT AAACTTCCAG AACTTTCATT TCAAACTTGG
AAACTATCAA GCCAAAATGG GAATCTGGTC TGGACAAACG TTTCTAGTGG AAAAGAGAGT
TCTTCGTCTG AGCACTCCTT TATTTTTAAA GATGAAGTAA TAAGAGGGAA TCTTAAGATT
AAGAAAATTG GACACACTTC TCTAAGTTCA TCCGATGGTT ATTCTGAAAT AAAAGAAATG
CCAAGTCTAA AGGGTGCCAA AATTGAACTT ACTAATAACT CCACTCAACC TGTTTTTTAT
CAAGATAAAT GGATTGCTCC TCACGAAGTT GTCACTACAG TAGAAACAGA TGAATCTGGC
GTTGCTGCAA TTAAAGACCT TCCATTTGGT TCCTATTCAC TTAAAGAAGT ACTAGCTCCA
GCAGGTTACT CTCTCAATAC AGAATGGAAT CCAACGGTTA CCCTTACTTC TGAAGAGACT
ATAGAAGCGC CCGAACTCAT TGATGAGAGA ATTGCTCTAC AAACAATGCT TGTAGACACT
TCGGGATCCA AAACTCCCAA ATATACTGAA ATATTAAATC TTGTTGATCA CATCAAATAT
GAAGGTCTCA CTCCAGGAGA AGAGTATGAA ATTACTGGAG AACTCTATGA AACAAAACAA
GTCCAAGAAG GTGCAGCAGA ACCCATCGCT CGCGGGACTG TCCGTTTTAA AGCTTCTACC
TCATCGGGAG AAGCCGCTGT TCCTTTTTCT GTAAGAACTA CTTCTCTTGA GGGTAAAGAA
GTTACTGCCT ACGAAACAAT CTCAAAAGAT GGAGAAAAGG TTGCTTCACA TACCGACAGC
CACTCTGAAG CTCAGACTAT TCGTGTAGCC CCTAAGCCCC ATCTTCCCGA AACGGCAGAT
AATGCTTACG AAATTCCTCT TTTATTCGCT CTGGCAGGCG CGTTACTCAT TGGATGTACT
CATTTATTTG CTACAAAAAT AAGACGGCTT TTTTGA
 
Protein sequence
MFITKSSKPK EKITAQKRSL ASAFHFWFIP LLIGGLLLSH LLLKPAHKLA FAQDPSFTVA 
QREDFPGGQQ IPMWTASDGT YLYCGDEFNH YGAWPGATGS IVDPQSYTKL TSATGARGGT
YTDEQLRAID YIIYHGATAS QEKDVYGYTG WKARAITQFA LWAVMRGEAH TLAVSVPQEE
LHQPIERFYT DAVNYAHNNS GGPENGIAKL FVPAGDKQVL FFFGEQSGSL KITKNSLLPA
ITSNNEHYAL EGAVYEVYSD EGCTNLLGSL TLDASGSATI DGLPVGCVYV KETSAPKGFL
LDPTVHTVEI KNQEESTLAV TDTPIGDFNL HISKQDFDHS ANLDEGNQAE QQGTSPQGNA
TLQGALFKVT YSGSSETTLR TWVFSTDAQG FTSFDADHKV SGDDLFTLDE KPWLPLGYYQ
IEEIQAPKGY KLPELSFQTW KLSSQNGNLV WTNVSSGKES SSSEHSFIFK DEVIRGNLKI
KKIGHTSLSS SDGYSEIKEM PSLKGAKIEL TNNSTQPVFY QDKWIAPHEV VTTVETDESG
VAAIKDLPFG SYSLKEVLAP AGYSLNTEWN PTVTLTSEET IEAPELIDER IALQTMLVDT
SGSKTPKYTE ILNLVDHIKY EGLTPGEEYE ITGELYETKQ VQEGAAEPIA RGTVRFKAST
SSGEAAVPFS VRTTSLEGKE VTAYETISKD GEKVASHTDS HSEAQTIRVA PKPHLPETAD
NAYEIPLLFA LAGALLIGCT HLFATKIRRL F