Gene Acry_3372 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcry_3372 
Symbol 
ID5159198 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAcidiphilium cryptum JF-5 
KingdomBacteria 
Replicon accessionNC_009468 
Strand
Start bp88134 
End bp89282 
Gene Length1149 bp 
Protein Length382 aa 
Translation table11 
GC content61% 
IMG OID640538692 
ProductHipA domain-containing protein 
Protein accessionYP_001220125 
Protein GI148243886 
COG category[R] General function prediction only 
COG ID[COG3550] Uncharacterized protein related to capsule biosynthesis enzymes 
TIGRFAM ID[TIGR03071] HipA N-terminal domain 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCGACC ACACGCCGGT TTTCTATGAG ACCCTGCTGG TTGGTATGAT CCATACGGAC 
ACCAGGGGCT CTTGCTTCAC CTATGACGAG AGCTGGCTGT CACGGACAGG CAGCTTCCAG
ATCTCACTCA CCATGCCGCT TGGCCGCCCA GCGGTGGAGC ACCATGTCAT CATGCCATGG
CTGGCAAACC TGCTGCCAGA AGGCGATGCC ATCAGTACGA TCGCGAGGCG GTCCGGAATA
GCCACCGGCG ATATTCTCAG CCTCCTCATG GTCGTCGGCA GGGACACCGC GGGCGCGTTG
AGCATTGGTC AGCCTCGCAG CCGCGAGGGG CGCCATTATA TGACGATCGC CGGTCAGGAC
GCGCTGGAGC GGCTCATCGA AGATTTACCG CGCCGGCCGT TGCTGTCGGG AGATGACGGC
GTCTCGATGA GCCTGCCCGG CGCGCAGGAG AAATTGCCCG TCGTTCTCAA TGACAACGAT
ATTGCGCTTC CGCTAAACGG GGCGCCGTCA ACCCACATCA TCAAGCCCAA CAACCGAAGA
CTGCCAGGCA GCGTTCAGAA CGAGGCACTC TGCATGGTTC TGGCACGACG GGTTGGGCTC
GATGTGGCCG ACGTCACCAC CGGCCAGGCC GGCAAGCGCT CCTATCTTCT GGTTGAGCGC
TATGACCGGA TCCAGCGTGG CGGCGTGTGG CGCCGGCTAC ACCAGGAAGA TTTCTGCCAG
GCACTGTCGC TACCTCCGGC GTCGAAGTAC CAGCACAACA GAACGGGTAT CCTTGGGCCA
GGACTGGCCG ACCTCTTCCG GACCGTCAGG ACTTTCATGA CGGCGCGCGA TACGATCAGG
CTTCTCGATG CGGTTATCTT CAACGTGCTG ATCACGAACG TCGATTCCCA TGCGAAGAAC
TATTCGATCA TGTTGACTGG GCGTGCCCGG CTCTCGCCAC TTTACGATCT GATGGCCGGC
GATGCGTGGT CCGAGGTTAC CCAGAATCTC CCTCAGGACA TCGGCGGCAA GAACCGCGGC
CAATACATCA ATCATTTGCA CTGGCGTCGG ATGGCGGAGG AATCAGGTCT CAGCGCCGGC
GCCGTTGTCC GGCGGGTGAT CCAGATGGCA ACGGCCCTTC CATCCATGCT CGATCAGGCG
GTTGATTAG
 
Protein sequence
MSDHTPVFYE TLLVGMIHTD TRGSCFTYDE SWLSRTGSFQ ISLTMPLGRP AVEHHVIMPW 
LANLLPEGDA ISTIARRSGI ATGDILSLLM VVGRDTAGAL SIGQPRSREG RHYMTIAGQD
ALERLIEDLP RRPLLSGDDG VSMSLPGAQE KLPVVLNDND IALPLNGAPS THIIKPNNRR
LPGSVQNEAL CMVLARRVGL DVADVTTGQA GKRSYLLVER YDRIQRGGVW RRLHQEDFCQ
ALSLPPASKY QHNRTGILGP GLADLFRTVR TFMTARDTIR LLDAVIFNVL ITNVDSHAKN
YSIMLTGRAR LSPLYDLMAG DAWSEVTQNL PQDIGGKNRG QYINHLHWRR MAEESGLSAG
AVVRRVIQMA TALPSMLDQA VD