Gene Acid345_2505 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_2505 
Symbol 
ID4069874 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp2961573 
End bp2962865 
Gene Length1293 bp 
Protein Length430 aa 
Translation table11 
GC content59% 
IMG OID637984522 
Productvon Willebrand factor, type A 
Protein accessionYP_591580 
Protein GI94969532 
COG category[R] General function prediction only 
COG ID[COG2425] Uncharacterized protein containing a von Willebrand factor type A (vWA) domain 
TIGRFAM ID[TIGR03436] VWFA-related Acidobacterial domain 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.622363 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.185219 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTTTCGC GAAGACTGGT TACTATGCAC CTTCGGAATA GTTCGTTCCC CCTCCGCGCG 
TTACCGCTCG TACTCCTAGG CGCCCTTGCG TTTGGGCAAA ACACCTCACA GCCGCAAAAT
CAGAATTCCC AACCTCCGGC AGGCCAGCAG GGCAAGTCTG ACGGCGGGCT CATCATGCCG
ATTGACGATG GTTCGCAGCC AGCGCAGGGG CAGCAGCAGC AGAATTCGGC GCAACCTCAA
CCCGGCCAGC AGGGTAAGAC GGACAATGGC CTGGTGATGC CAATCGAGAA CGGCCAGTCC
GAGGCGCCGG TTCCGAAGTC GCCGAACCAG CCGGGAGATA CGGTCAACGT TCCCGCGAGC
AGTTCGCGCG GCAACGGGCA AAATCCCGAC TCCGAAGTCG GCGGCGTGTA CACGTTCAAG
AAGCAGGTTG AAGAAGTTCG CCTGCACGCG ACCGTGGTCG ACGATCGGCA GCGGTTGATC
ACGACCCTCG ATAAGACCTC GTTTACGGTT TACGAGAACG GCGAGCCACA GCAGATTACG
TCATTCCGGC ACGAAGATAT TCCTGTCGCG CTGGGCGTTG TGATCGATAA CTCCGGCTCA
ATGAGGGACA AGCGTCCGGC AGTGAATGCG GCGACGATCA ACCTCGTGAA AGCCAGCAAT
CCAGAGGACG AGGTGTTCGT CGTGAACTTC AACGACGATT ATTATCTCGA CCAGGACTAC
ACCGACAGCG TTGCGAAACT GAAAGAGGCA CTGGAGAAGT ACGAGACCCG TGGTGGCACG
GCGTTGTACG ACGCGGTGCT GGCCTCGAAC GCGCACTTGA TGAAGGCTCC GAAGCTGGAG
AAGAAGGTTC TGTTCATCGT TACGGACGGT GAAGACGATG CCAGCCTCAA TACGTTGGAG
CAGACGATCC GCAAGGTGCA GCAGGAGAAC GGGCCGACGA TTTACACCAT CGGAATTCTG
GATGAAACCG GTGGGCATAA GCGTCGCGCG CAACGTGCAC TTCGTGAGAT GGCGGAATCC
ACCGGTGGCG TGGCGTTCTT CCCGCAGAGC CTCGACGAAG TGAGCCGGAT CACGCAGCAG
ATCGCGCACG ATATCCGCAA CCAGTACACG ATTTCGTACA AGCCGACGAA TCCACAAGCG
CGTGGTGGCT ATCGCCAGGT GAAGGTAGAG GCGAAGTCGA AGGGCTTCAA GGCCCTGCAG
GTTCGGACGC GCGCGGGCTA TTACGCGGGA CAGACGCAGT CCGCGGCCAA TCCGCCGGTG
CGTAAGCCGG AGACCAACGC GGCTGTGCGA TAA
 
Protein sequence
MFSRRLVTMH LRNSSFPLRA LPLVLLGALA FGQNTSQPQN QNSQPPAGQQ GKSDGGLIMP 
IDDGSQPAQG QQQQNSAQPQ PGQQGKTDNG LVMPIENGQS EAPVPKSPNQ PGDTVNVPAS
SSRGNGQNPD SEVGGVYTFK KQVEEVRLHA TVVDDRQRLI TTLDKTSFTV YENGEPQQIT
SFRHEDIPVA LGVVIDNSGS MRDKRPAVNA ATINLVKASN PEDEVFVVNF NDDYYLDQDY
TDSVAKLKEA LEKYETRGGT ALYDAVLASN AHLMKAPKLE KKVLFIVTDG EDDASLNTLE
QTIRKVQQEN GPTIYTIGIL DETGGHKRRA QRALREMAES TGGVAFFPQS LDEVSRITQQ
IAHDIRNQYT ISYKPTNPQA RGGYRQVKVE AKSKGFKALQ VRTRAGYYAG QTQSAANPPV
RKPETNAAVR