Gene Acid345_2506 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_2506 
Symbol 
ID4069875 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp2962825 
End bp2963913 
Gene Length1089 bp 
Protein Length362 aa 
Translation table11 
GC content56% 
IMG OID637984523 
Productvon Willebrand factor, type A 
Protein accessionYP_591581 
Protein GI94969533 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG1240] Mg-chelatase subunit ChlD 
TIGRFAM ID[TIGR03436] VWFA-related Acidobacterial domain 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.179837 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGCAGCA GGAATTGTAT GGTGCACGTG CCAGTTGCCG CGTGGATCCT GCAAGTCTCG 
CGCGCCGGAG TGTCGGCGCC ATCTCGCTTT CGCCTCACCC TTCTCGGATG GATGATGTTA
CTGGCGGTGA TCGTCGGTTT TGCAGTTCCC GGGCTCGCGC AGGTGGACAG TAACGAGGTC
CACGTGCAGC CTCGCGAGGC GCCAAAGCCT CCTACCCCGC CGCAAGGCGA TCCTGCTGAC
GTGAATACCC ATACCCGTCC GATGCGGGTG GATGTGAATA TCGTGCTGGT GCCGGTAACC
GTGACCGATC CGGACAACCG GCTGGTAACC GGGCTTGAGA AAGAGAATTT CGAAGTTCTG
GACCAGAACA TTCCGCAACA GATCCGGCAC TTCTCGAGCG AAGACGCTCC GGTGTCCATC
GGCGTGATTT TCGACATGAG CGGGTCGATG TCGAACAAGA TTGATAAATC GCGCGAGGCG
ATTGTCGAGT TCTTTAAGAC CGCCAATCCG GATGACGAGT TCTTTGTGGT CGCCTTCAAC
GACAAGCCGG AAGTGTTGCA GGACTTCACC AACAGGATTG AGGATATCCA GGAGAAGTTG
ACGATTCTTC AGCCGAAAGA CCGGACGTCG CTGCTCGATG CCATTTACCT GGGCATGAAC
AAGATGCGGC AGGCGAAGTA CGAACGGAAG GCGCTGCTGA TCATCTCCGA TGGCGGCGAC
AACCATAGCC GGTATACGGA AAACGAGATT AAAAGCATGG TGCGCGAGGC CGATGTGCAG
ATTTATGCCA TCGGAATTTA TGACCTGGCG CCGACGACAA CGGAAGAGAT GGCGGGGCCA
GCACTGCTCG GGGAAATCTC TGATTGGACC GGCGGACGTA TGTTCCCGAT TGATAACGTC
AATGAATTGG CGGACGTAGC CACAAAGATA GGAGTAGAGC TGCGCAACCA ATATGTACTC
GGATACCGTC CAAGTAAACC AGCGAAGGAT GGCAAATGGC GGAAAATCAA GGTCCGTCTG
AACCCGCCTA AGGGCTTGCC TCCGCTCCAT GTTTTCGCGA AGACTGGTTA CTATGCACCT
TCGGAATAG
 
Protein sequence
MGSRNCMVHV PVAAWILQVS RAGVSAPSRF RLTLLGWMML LAVIVGFAVP GLAQVDSNEV 
HVQPREAPKP PTPPQGDPAD VNTHTRPMRV DVNIVLVPVT VTDPDNRLVT GLEKENFEVL
DQNIPQQIRH FSSEDAPVSI GVIFDMSGSM SNKIDKSREA IVEFFKTANP DDEFFVVAFN
DKPEVLQDFT NRIEDIQEKL TILQPKDRTS LLDAIYLGMN KMRQAKYERK ALLIISDGGD
NHSRYTENEI KSMVREADVQ IYAIGIYDLA PTTTEEMAGP ALLGEISDWT GGRMFPIDNV
NELADVATKI GVELRNQYVL GYRPSKPAKD GKWRKIKVRL NPPKGLPPLH VFAKTGYYAP
SE