Gene Acid345_1366 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_1366 
Symbol 
ID4068842 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp1657456 
End bp1658931 
Gene Length1476 bp 
Protein Length491 aa 
Translation table11 
GC content60% 
IMG OID637983375 
Productvon Willebrand factor, type A 
Protein accessionYP_590442 
Protein GI94968394 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.808281 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCACAG TTTCGAAATT GCGCAACTCT GTATTGGTTT TTGCCGCGTG CATGACTGCA 
TCCGGAGTCT CCCTCGCGCA GAACGAACTC GATCTCTATA CCACTGCCGT GCGCCAAAGC
CGCATCTCCG ACCGGAGTGC GTGGATGGCG CGGTTTCTAA AGGAACATCC GCAGAGTGAT
TTGCGCGAGG ATGCGCTTGA AGTCCTGGTA TGGGACGCGA TGGAAAGCGG CCAGCGTGAC
CAGTCGCGCC AATATGCGCA AGAGTTGCGG CAGATTGATC CGCACAATGC GCTGGCGATG
GCGGTCGTGG CTGAAACGCG CGCAGAGACC ACCGGTCGCG CCGACAAGAA AGCGGCGGCA
CAGGCTTTTG AAATAGCGAA GGCCGGGATC CAGGTGTATC CGCAAATGCA TCGGCCGGAA
GGCATGCGTG AGGGCGAGTT CATCCTGCTA CAGCGGCAGG TGGTCGCGGT GCTCGATGGC
GAAGCAGGCC TTGGATATCT CGCGGATAAA GATTACGAAG CGGCGCGCCG ATATCTGCAC
GAGGCGGTTG CCATTCGTCC ACAGGACCCG CGTTATCTTT ACGGACTTTC GTTGGCGCTG
CTCGACGGGA AAGACGCGAT GCAGCAGGAG GGCTATCTCT ATCTCGCGCG CACAGTGAAC
CTGACGCAAG GAACGCCGGC GGGACAGCAG ATCGCAAATT ACGCGCAGAA ACGCTTCGAA
AAACAGGGCG GTACGACTGC ATCGTGGAAC GAGTATCTTG CCGCGGCGAC GACGCCGGGC
ATGCCGCGAC GTGCGCCAGC GACGCAGCCT GAGGCCCCAA TAGTCGCGAA GAACGTGCCG
CCAACGCGGC CGGGAGTGCA GCCGCAATCC GAACCGCGTG AGACGAACCC TGAGGAGATT
CCGCAGCCGA CGTTTAAGCG TGAGTATGTG GCGCGCACGT CTCCCGTCTC GATGGGCATT
TTGATTCAGA CAGAACACCT GACGAAGGAG AACCGCCGAC AGATCCTCGA CGCGCTTACC
GACATGATCC GGCACCTGCG CAACGACGAT GAAGTGTTCA TCATGGCGTA TGGCAAGAGC
CTGCAATTCG AGCAGGACCT CACCGGTAAT CCGAAGCTTC TGGAAGAAGC GATGGAGCAG
ATCAAAGCGG AGAGCGGCAC TGCCCTGCTC GATGCCGTGG GCTTTGCTGC GGGACACCTG
GAGCGCATTG CGACCAACAA GAACCGGCTG TTGCTGGTGA TTTCGGATGG GCGGAATACG
CCGTCGAAGG ACAATCCGCT TACGCTCTCG CAGAAACTGA ATACGGTGCG GGTGGATTGC
ATTGGGCTTG ATGTGGATGG CGATTCGGGG CGGCGTCAGT TGGAGTCGCT GGCGGCGTAT
TCAGGCGGGC AGGTGAGTTT TGCGAGCGAC ACGCGGCAAC TGCGCACGGC GGCGGTGCAG
ATGGCGGAAG CGATTGGGAT TGAGTTTCCG GATTAG
 
Protein sequence
MSTVSKLRNS VLVFAACMTA SGVSLAQNEL DLYTTAVRQS RISDRSAWMA RFLKEHPQSD 
LREDALEVLV WDAMESGQRD QSRQYAQELR QIDPHNALAM AVVAETRAET TGRADKKAAA
QAFEIAKAGI QVYPQMHRPE GMREGEFILL QRQVVAVLDG EAGLGYLADK DYEAARRYLH
EAVAIRPQDP RYLYGLSLAL LDGKDAMQQE GYLYLARTVN LTQGTPAGQQ IANYAQKRFE
KQGGTTASWN EYLAAATTPG MPRRAPATQP EAPIVAKNVP PTRPGVQPQS EPRETNPEEI
PQPTFKREYV ARTSPVSMGI LIQTEHLTKE NRRQILDALT DMIRHLRNDD EVFIMAYGKS
LQFEQDLTGN PKLLEEAMEQ IKAESGTALL DAVGFAAGHL ERIATNKNRL LLVISDGRNT
PSKDNPLTLS QKLNTVRVDC IGLDVDGDSG RRQLESLAAY SGGQVSFASD TRQLRTAAVQ
MAEAIGIEFP D