Gene Acid345_1866 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_1866 
Symbol 
ID4073025 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp2244516 
End bp2246402 
Gene Length1887 bp 
Protein Length628 aa 
Translation table11 
GC content56% 
IMG OID637983875 
Productvon Willebrand factor, type A 
Protein accessionYP_590941 
Protein GI94968893 
COG category[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG5010] Flp pilus assembly protein TadD, contains TPR repeats 
TIGRFAM ID[TIGR03436] VWFA-related Acidobacterial domain 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTCGCGAA GCGTTTCCAT TTTATATTCG CTGCTTGTGC TGAGCGGATG GGCTTGTGCT 
CAGGTGCTGA GCTTTCCCGC GTCCAATCCA GCCGCCATTA CTACGGGCGC GGGGTTGAAC
CTTGTCCAAG CCACGGTAAT GGACAGCAAT ATCGCGCTGA TCGATAGCTT GAATCATAGC
CCATTAGAAA ACCCTACCAT TGCGCTTTCT AAGCTGGATC TGAAGGCACC GGGCAAAGCG
CGTCAGGAGT ATGAAAAGGG GTATCAGGCG CTCGCGAAGA AAGACTTCAC CCAGGCCTTG
GGTCATCTCG AAAAAGCGAC TGCGATCTAT CCGAGTTACG TGTCGGCGTT CAATGCGCTC
GGAGCAGCTC ATTTAGGTCT GGGTCAAAGC GATGAGGCGC GCGCAGCGTT TGCCGAAGCA
ATATCGCTCG ACGACCACCT GCCCAATTCT TATCTGAACA TGGGCTGCGC GGAACTGGCT
CTCAAGGATT ACGCCGGCGC AGAGCGAGAC ATAACACAAG CTTCTTCCAT GGCGCCTCTT
GATTTTCAGG TGAAAGCAGC CCTCGCATAC AGCCAATATA TGAACAACAA CTATCAAGCT
GTGGTTGCCA CGGCGGATGA CGTACATGCC CGCAAACACA GTGGCGCTGC GCTGGTTCAT
TTCTACGCCG CAGCTGCCTG GGATGCACAA GGAAACCCGG CGTATGCGCA GCGAGAACTC
CGGCTTCTGA TGAAAGAGGA CCCAAAATCA CCAGCGGCGA TCCAAGCGAA AAGCTTGATG
CAACAGTTGC AAGATGAAGG CGTTCACTCC AAGAAGACGA CCCGTGTTGA GAGCGGCGAC
CTGACTCTCG TCTCGAAAGT CTCTCTCCAA GTACCGTCCG ACGACGAGGA TGCTGAGCAA
AAGAAAAAGC AAGATCAGAA AGAGTTAGCA CAGCTCAACG ATGCGGATGC TCTGGACCGT
ACCCAACAGG ATGCTGCTGG TGAGGGAGTA GCTTCGGTCG CCACGCCCGA GTCTGCAGGT
GGCACGACCG GCTACACATT CCACGCCTCC ACCGATGAGG TGGCGGTCCT CTTTGCTGCG
ACCGACCACG GAAGGGCCGT GCCTGATCTC GACGTGAAAG ACATCAAGCT GTTGGATGGC
CGCCACGCTC CTGCGCTGGT CACCGGCTTC CGCAATGAGG CTCAGCTCCC CTTACGCATT
GGATTGGTGA TTGATACCAG TGCTTCCATC GCGGGCCGAT TCAAGTTCGA GCAGGACGCG
GCTGGCGAAT TTCTTCAGAG AGTCCTTACT GGCCCCGAAG ATCTCGGTTT CGTTGTCGGA
TTCTCGAACT CCATTCTCAT GGCGCAGGAC TTTACGCACG ATTCGAAGCA AATTGCCCAC
AGCATTCAGG CCTTTGCTCC CTCCGGTGGT ACAGCGCTTT GGGATGCAGT GAATTTCGCG
GCGGAGAAAC TGGCTAGCCA TCCGGAGAGG CAGCCGGTGG CGAAGATCCT TATTGTCATC
AGTGATGGAG AAGACAACTC AAGCGCCACC ACGGCAAAAC AGGCGATCCA ACGCGCCCAG
AGTGAAGAGG TGGCGGTTTA CGCAATCAAC ACGCTTGAAA TTACGCAACG TTCGGAGGAG
CCTCCGGTCG GCGTGCGCGC TCTGAAAACA CTGGCGGAGA TGACCGGCGG CGCAGCCTTC
ACTCCCGGAT CAGTGCGGTG GCTCAACAGC AGCTTGAACG ATCTCCAGCA AGTCATCCGT
AGTCGATATC TCATCACATA CAAGCCTTCA GGATTTAAAC GGGACGGCAG CTATCGCCGG
GTGCAAGTAG CGGCAGAGAA AGATGGACGT AAACTGCATG TGGTCTCGCG CAGCGGCTAC
TACGCGACAG AGAAGCCCGC GAATTGA
 
Protein sequence
MSRSVSILYS LLVLSGWACA QVLSFPASNP AAITTGAGLN LVQATVMDSN IALIDSLNHS 
PLENPTIALS KLDLKAPGKA RQEYEKGYQA LAKKDFTQAL GHLEKATAIY PSYVSAFNAL
GAAHLGLGQS DEARAAFAEA ISLDDHLPNS YLNMGCAELA LKDYAGAERD ITQASSMAPL
DFQVKAALAY SQYMNNNYQA VVATADDVHA RKHSGAALVH FYAAAAWDAQ GNPAYAQREL
RLLMKEDPKS PAAIQAKSLM QQLQDEGVHS KKTTRVESGD LTLVSKVSLQ VPSDDEDAEQ
KKKQDQKELA QLNDADALDR TQQDAAGEGV ASVATPESAG GTTGYTFHAS TDEVAVLFAA
TDHGRAVPDL DVKDIKLLDG RHAPALVTGF RNEAQLPLRI GLVIDTSASI AGRFKFEQDA
AGEFLQRVLT GPEDLGFVVG FSNSILMAQD FTHDSKQIAH SIQAFAPSGG TALWDAVNFA
AEKLASHPER QPVAKILIVI SDGEDNSSAT TAKQAIQRAQ SEEVAVYAIN TLEITQRSEE
PPVGVRALKT LAEMTGGAAF TPGSVRWLNS SLNDLQQVIR SRYLITYKPS GFKRDGSYRR
VQVAAEKDGR KLHVVSRSGY YATEKPAN