Gene Acry_0472 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcry_0472 
Symbol 
ID5161594 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAcidiphilium cryptum JF-5 
KingdomBacteria 
Replicon accessionNC_009484 
Strand
Start bp525547 
End bp526758 
Gene Length1212 bp 
Protein Length403 aa 
Translation table11 
GC content72% 
IMG OID640552388 
ProductVWA containing CoxE family protein 
Protein accessionYP_001233615 
Protein GI148259488 
COG category[R] General function prediction only 
COG ID[COG3552] Protein containing von Willebrand factor type A (vWA) domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.180421 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCGCCT CCGATACGGG ACGGCTCGCG CCGAATGTGA TGCATTTCGC GCGCCTGCTC 
CGGCGTGCCG GCCTGCCTGT CGGCCCCGGC GAGGTCATCG CCGCCGCCGA GGCGCTCACC
CATGTCGACA TCACCGACCG GCGGGTGATG CGAGCGGCGC TGCAGGCGGT GATGCTGCGC
CGGCACGAGC ACGCGCTGCT GTTCGACCAG GCCTTCTCGG TCTTCTGGCG CAACCCCGAC
GCCGCCAAAT TCGCCCAGAT GCTGGCAGCG ATGGATGGCC GCGCCCCGCG CGAGGAAAAG
GCCGCCGCCG GCGCCCGCCG CCTTGCCGAG GCGATGCAGG CGGCCAAGTC GCGCGAGCAG
GAACCCCGCC CCGACGAACG CCGCGAGGTC GACGCGCTGC TCTCCGCCTC TGGCCAGGAA
CGCCTCGAAT CCCTCGATTT CGAGGCGATG AGCGCCGAGG AGATCGCCGC CGCCAAGGCC
GAGATCGCCC GCCTCACCCT CCCGCTCGAC GAACGGCGCA CCCGCCGCTT CCGCCTCGCC
GCCCGCGGCT GCCGGGTCGA TCTCAAGCGC ACCCTGCGCG ATTCGATGCG CCATTCCGGC
GAGGTGTTCG ACATCGCCCG CCGCGTGCCG CTCACCCGCC CGCCGCCGCT CGTCGTCCTG
TGCGACATTT CCGGCTCGAT GGCCCGCTAC GCGCAGATCC TGCTGCACTT CCTCCATGCC
GTCGCCAACG AGCGCGACCG CGTGACCACC TTCCTCTTCG GCACAAGGCT GACCAACATC
TCCCGCCAGC TCGCCCGGCG CGACCCGGAA GAGGCGTTCG AGCAGGTTGC CGGCGCGGTG
CCGGACTGGT CGGGCGGCAC CCGCATCGGC GAGGCGCTCG GCCAGTTCAA CCGGCTCTGG
GCCCGCCGCG TGCTGGCGCA GGGCGCGGTC GTCCTTCTCG TCACCGACGG GCTCGACCGC
GAGGGCGCCG TCGGCCTCGC CGACAACATG GCAAGGCTGC ACCGCTCCTC GCGCCGGCTG
ATCTGGCTCA ACCCGCTCTT GCGCTACGAT GGCTTCGCGC CGAAATCGCA AGGCGCACGG
GCGATGCTGC CCTATGTGGA CGAGTTCCGC CCGGTGCATA ACCTGGCCAG CCTGCGCAGC
CTGGTCCAAG CCCTGTCGGG CGAGGCGCCG CCGCGCCTGC AGGCCGCCGC CCTGTGGGAG
ACCCGCCAAT GA
 
Protein sequence
MSASDTGRLA PNVMHFARLL RRAGLPVGPG EVIAAAEALT HVDITDRRVM RAALQAVMLR 
RHEHALLFDQ AFSVFWRNPD AAKFAQMLAA MDGRAPREEK AAAGARRLAE AMQAAKSREQ
EPRPDERREV DALLSASGQE RLESLDFEAM SAEEIAAAKA EIARLTLPLD ERRTRRFRLA
ARGCRVDLKR TLRDSMRHSG EVFDIARRVP LTRPPPLVVL CDISGSMARY AQILLHFLHA
VANERDRVTT FLFGTRLTNI SRQLARRDPE EAFEQVAGAV PDWSGGTRIG EALGQFNRLW
ARRVLAQGAV VLLVTDGLDR EGAVGLADNM ARLHRSSRRL IWLNPLLRYD GFAPKSQGAR
AMLPYVDEFR PVHNLASLRS LVQALSGEAP PRLQAAALWE TRQ