Gene Acid345_3993 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_3993 
Symbol 
ID4071129 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp4722858 
End bp4724033 
Gene Length1176 bp 
Protein Length391 aa 
Translation table11 
GC content60% 
IMG OID637986020 
Productvon Willebrand factor, type A 
Protein accessionYP_593067 
Protein GI94971019 
COG category 
COG ID 
TIGRFAM ID[TIGR03436] VWFA-related Acidobacterial domain 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.889697 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCAACG TCCGCATCCT TCTCTGCCCC TTGGTGGGCT TTTGCCTGCT TGCCTCCATC 
GCGACGGCAC AGGACAAACC GACGCTACGC ACTTCGTCTC CACCGGCAGA AGAGCGCCAG
CCAGAAGCAC AGCCTGACAC CCCGTCGTTT CACGTTGAGG TAAAGGAAGT CACCCTGCCG
GTGACGGTGC GCGACAAGCA CGGCAAGATC GTCCAGACGC TCAATAAAGA AGACTTCAGC
CTGGTGCAGG ACGGTAAGAC GCAGACGATC ACCCAGTTCC GTCGCGACAC CAAACTGCCG
CTGACGCTTG GCCTTTTAGT GGATACGAGC TACAGCGTGC GCGACGAGTT GCCGGCCGAA
AAAACGGCAA GCGAGAAGTT TCTCGACGAT ATGCTGGCGC AACCGAAAGA CCAGGCGTTC
CTTATCCACT TCGACCGCGA AGTGGAGTTG ATGACGGACC TGACCTCGTC GAAAGACAAG
CTGCACAGGG GCATTGGCGA GTTGGAGACT TCCGGCCCTC CATCGCAAAG CAGTAGCGAT
GACGGCCAAC GCCATCGGCG GGGAGGAACG CAGCTCTACG ACGCCATCTA CCTGGCGGCA
TCCGAGATCT TGCAGAAGCA GCAAGGGCGG AAGGCAATCG TCGTTCTTAC TGATGGCGAA
GATCGCGGCA GCAAGGAAAC TCTGACCGAC GCAGTGGAGG CAGCACAGCG CGCGGACGCG
ATTGTCTATG CGATTTACTT CAAGGGAGAG CAGGAGCAAA GCCGGTGGGG CAACGGAGAT
CACGGTAACC GTGGCGGCAT GGGTGGCCCG CGGATCGGCT ACCCAGGTGG AGGTGGCGGT
TATCCGGGTG GTGGCGGCGG ACGATACCCC GGCGGCGGTG GTGGTCGCGG TGGCGAGCAA
CGCGAGGCTC GGTTAGATGG GAAGAAGATC CTGACCGAAA TCGCGAGCAA GACCGGCGGA
CGGATGTTCG AAGCCAGCAA GAAGGAGAAC GTCGAAGCAA TCTATGCGCA GATCGCCGAA
GAACTGCGTA GCCAGTACGT GTTGGCGTAC ACGCCGGACC ATTCCAGTGC CGATGCCGGC
TATCACCGTG TAACCGTTGC GGCGAAGGAC AAAGAACTGA AGATCCAAAC CCGCGAAGGG
TTCTACATCC CCGAACAAAC CACGGCAACG AAGTAG
 
Protein sequence
MSNVRILLCP LVGFCLLASI ATAQDKPTLR TSSPPAEERQ PEAQPDTPSF HVEVKEVTLP 
VTVRDKHGKI VQTLNKEDFS LVQDGKTQTI TQFRRDTKLP LTLGLLVDTS YSVRDELPAE
KTASEKFLDD MLAQPKDQAF LIHFDREVEL MTDLTSSKDK LHRGIGELET SGPPSQSSSD
DGQRHRRGGT QLYDAIYLAA SEILQKQQGR KAIVVLTDGE DRGSKETLTD AVEAAQRADA
IVYAIYFKGE QEQSRWGNGD HGNRGGMGGP RIGYPGGGGG YPGGGGGRYP GGGGGRGGEQ
REARLDGKKI LTEIASKTGG RMFEASKKEN VEAIYAQIAE ELRSQYVLAY TPDHSSADAG
YHRVTVAAKD KELKIQTREG FYIPEQTTAT K