Gene Apar_0413 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagApar_0413 
Symbol 
ID8413262 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAtopobium parvulum DSM 20469 
KingdomBacteria 
Replicon accessionNC_013203 
Strand
Start bp476774 
End bp478741 
Gene Length1968 bp 
Protein Length655 aa 
Translation table11 
GC content38% 
IMG OID645021981 
ProductGlucan-binding protein C 
Protein accessionYP_003179435 
Protein GI257784218 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAGGG TCTCAACCTC TTCTTTTACG AGAGTAGCTG CAGTGTCATT ATCTCTTCTT 
TTGATATTGA CTCTTTTTCC TCTTAATCCT GTAAAAGCTT ATGCAACAGA TCCTGCAAAA
GGTATAGAGG TTCGTGAAGA TGCTTCTAAA TTAGAGGCGA GTGTTGCAGC AGCAAAGGCA
GCTGGTGTAG ATGTTCAAAA AGATGATGAT GTTGACAAGG GTTCTGTCGA ATCGTCATCA
GATATTGATG CTAAAAAGTC TGAGATTCAA GATGATTACA ACAAACAATC TCAAGATTTA
GATGCTATTA CAGAAGAAGC TAAACAGAAG CTAGCAGACT ATGCTACAAA AAAGGCTGCT
TATGATACCG CCAAGGCTAA GTATGACGCA GACAAAGCTC AGTATGATGC AGACAAAGCT
CAGTATGATG CAGATATGAT TGCTTATAAT AAAGCTATGG CTGAGCTTGA ACAGAAAAAG
AACGAAGATG GCTATATGAC TAAGCCATAT CCGCAGCTTT TAACATTTAA ATCTGAGCCA
AATGCAGTAC TTACTCTTTC AGGCAGAAAA TATACACATG ATGAGTTTAG TGCAGAAGTT
AGATCTTGGA ATCTTGGATC TGAGCCATGG AGATATTCAT ACTTTGATGC ATTAAATAAT
GGACAGGCTG CTAATGCAGC ACGTGTTATG TTAGAAAAAG ATAAGCCTTT CACTGCTACA
TATACAAATT TGACTAATTC TAGCTATAAC GGTAAGAAGA TTTCTAAAGT TGTATATACC
TATACGTATA AGGGTTCTTC GGGAGTAAAC GTACCTAATA AGCTTCCAGT TGTCTTGCAA
AAGGACCCTA CGGTTACTAT TTGGTATAAC GATTTCTTCG GTGATGCTCG AATTAATGTA
ACTGTTAAAT TTTATGATGA AGACGGTAAT GTAATTGACC CAACTGGTTC ATTACTGAGT
TTTTCTTCAT TAAATAGAGG AAATGGATCC GGTGCAGTTG ATAAAGATGC AATTGAAAAG
GTTGGATACT TTAACGGCGA ATATGTGCCT ATTTCAGGAT CAACAATTAA ACCCCATGCT
GATGGCAGCG CGTATTCAGA TACAAATAAT GCTGAAAAAG CTTATGGTTC CAGATTTAAT
ACAGCTGACT GGGATACACC AACTTCTCCT AAAGCATGGT ATGGTGCCAT TGTTGGCCGA
GTAACAAGTC CTGAAATTAG TTTTGATATG GCCTCTCATA AGAGCGGCAT TGTTTGGTTT
GCTCTCAATT CGGATATTAA GGCAATTAAT GTGCCCACCA AACCAGTTGA GCCAACACCA
CCTACACCTC CAGCAGAAGA GCCTGAGAAG CCAACATTTA GCGCTCGATA TCATTTGGAC
GTATTCTATG TAAAGCCTCA GTTAGAGAAG AAAGCATTAA GTGAGGATGA TAAGGATATT
AATTCCAATA CAGTAAAGAC TAATTCTGTT GTCAAATTTG CATTGAATAC TACGCCTTTT
CCAGCAGGGC ATGAAAAAAT TGACTCTGTA GTTTTCCATG ATGTATTACC CGAGGGTTAT
GAAGTCAATT TGGAAGACAC AAAGAAGGCA AGTCCTGATT ATGAAGTAAG TTACGATGAA
GGTACGCGTA CGCTTGTCTT TACAGCAAAT GCTTCTTTGC TCAGTCAGAT TAATGCTGAT
CCAACTAAGG AAGCTGATGT TCCTGCTCCT GTAATCGTTG GTAAAGTTAC AAAAGATGGA
GCTGTTTACG AAAACGACTT TGATATAGAT ATTAATAATA CCTATACAAG AAGCTCAAAT
AAGGTAACGG TAAAGACACC TGAGCCTCCT AATCCGCCAA AGAAACGTAA AAAGAAGGCT
AAAACTCCAT ATACAGGAGA CGCAGGAGTA TTTTCTTCTA TTGCTCTTTG TACAGGATCA
ATAGCAGTAT TAGGTGGTTC TTGGTTTATT AAGAAGAAAA AGAAATAG
 
Protein sequence
MKRVSTSSFT RVAAVSLSLL LILTLFPLNP VKAYATDPAK GIEVREDASK LEASVAAAKA 
AGVDVQKDDD VDKGSVESSS DIDAKKSEIQ DDYNKQSQDL DAITEEAKQK LADYATKKAA
YDTAKAKYDA DKAQYDADKA QYDADMIAYN KAMAELEQKK NEDGYMTKPY PQLLTFKSEP
NAVLTLSGRK YTHDEFSAEV RSWNLGSEPW RYSYFDALNN GQAANAARVM LEKDKPFTAT
YTNLTNSSYN GKKISKVVYT YTYKGSSGVN VPNKLPVVLQ KDPTVTIWYN DFFGDARINV
TVKFYDEDGN VIDPTGSLLS FSSLNRGNGS GAVDKDAIEK VGYFNGEYVP ISGSTIKPHA
DGSAYSDTNN AEKAYGSRFN TADWDTPTSP KAWYGAIVGR VTSPEISFDM ASHKSGIVWF
ALNSDIKAIN VPTKPVEPTP PTPPAEEPEK PTFSARYHLD VFYVKPQLEK KALSEDDKDI
NSNTVKTNSV VKFALNTTPF PAGHEKIDSV VFHDVLPEGY EVNLEDTKKA SPDYEVSYDE
GTRTLVFTAN ASLLSQINAD PTKEADVPAP VIVGKVTKDG AVYENDFDID INNTYTRSSN
KVTVKTPEPP NPPKKRKKKA KTPYTGDAGV FSSIALCTGS IAVLGGSWFI KKKKK