Gene Acry_2964 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcry_2964 
Symbol 
ID5159819 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAcidiphilium cryptum JF-5 
KingdomBacteria 
Replicon accessionNC_009484 
Strand
Start bp3244178 
End bp3245659 
Gene Length1482 bp 
Protein Length493 aa 
Translation table11 
GC content68% 
IMG OID640554894 
Productlipopolysaccharide biosynthesis protein 
Protein accessionYP_001236073 
Protein GI148261946 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG3206] Uncharacterized protein involved in exopolysaccharide biosynthesis 
TIGRFAM ID[TIGR03007] polysaccharide chain length determinant protein, PEP-CTERM locus subfamily 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAACAGC TCATCGAAAC CGCACTGCGG CTGCTCGCCA CCGCCTGGCG CAAGCGCTGG 
TACGCGCTCG CCACCGCGTG GCTCGTCTGC GCGCTCGGGT GGACGGCGGT CATGCTGCTG
CCGCCGAATT TCGAGGCCAG CGCGCAGCTC TACGTCGCGG CCGACCCGGT GCTGACGCCG
CTGCTGCGCG GCATCGCCAT CAACGGCAAT TCCGAACAGG AATTCAACCT GCTCCGCCAG
ACGCTGCTCA GCACCCCCAA TCTCCAGCAC CTGATCGACC GCGAAGGGCT GCACGCGCAG
GGGCCGGGTG CGCGCGAGGC GCTGGTGCGC CGGCTGCGGG CGCGGGTGAC CGTGGTGCCG
CAGAGCCGCA ACCTGTTCAC CATCCGCTAT GTCGGCCACG ATCCGCGGCG CGCCTACAAC
ATCATCGCCG GTCTGGTGAA CATCTATGTC GAGCGCGCGT CGGACCACAA CCAGAGCGAC
ATCGACAATG CCGGCAAGTT CCTGCAATCG CAGATCGACT ATTTCCACAA CCAGCTGAAA
TCGCTCGAAG CGCGCCGCGC GGCGTTCCAG GCGAAATATC TCGAACTGCT GCCGGGCAGC
GACGGCGTTT CCGCCGTGCG GGCATCGGGC GCGCGGGTCC GCAAGCTGGA GACCGAACTG
CAGGACGCCA AGGCCGAGCA GGCGCTGCTG GCCAGCGAAC TCGCCAAGAC CAAGCCGCTG
CTGTCGGAAA CCCAGGCCGC CGGCGGCAAC CCCGCGCTCG CCGCGGCCCT CGCCAACCTG
GCCAGGCTGC GCCAGCAATA CACCAACAGC TATCCCGGCG TGCAGGCGGC CGAACGGCAG
GTCAAGGCGC TCGAACACGG GCCGGCCGGC GGCGGCAAGT CCAGCTACAG CGTGCCGGTC
GCCAATCCGG TCTACAAGGC GCTGCATCTC GAGATCCTGC AGACGCAGAC CAGGATCCTC
GAGACGACCC GCGCGCTGGC GCGCGCCAAG GTGGAGCATG CGAAGCTGAC CGCGCTCGCC
CGTTCGGCGC CCGGCGTCGA GGCGAAGTTC ATCAACCTCA ACCGGAATTA CGGTGTCCTG
CAGAAGGAAT ATCAGGACCT GATCAGCCGG CGCGAGGCGA TGCGCATCGG CGCCGCCGCC
AATATCGATG CCAACCAGGT GCAGCTGCAG GTGATCAATC CGCCGGTTCT GCCCCGGCTT
CCCATCGGGC CGAACCGGCG CCTGTTCCTC GTCGCCGTGC TGGTCCTCGG CATCGGTGCG
GGCCTCGGCG TCGGCGTGTT GCTCGGCGAA CTCGAGGGCC GCGTCCGCTC CGAGGCGGAT
CTGCGCGGCT TCGGCATCCC GGTGATCGGC CAGATTTCCG ACATCTCGCC GCAATCCGGC
GTGATCATGC CGGCGCTGCG CATCGGCATC GGCGGTTCGC TGCTCCTGGG CGTGTTCGGC
GCGCTCTTCA TTGCGACCTT CATCATCGGG GGGCTCGGAT GA
 
Protein sequence
MEQLIETALR LLATAWRKRW YALATAWLVC ALGWTAVMLL PPNFEASAQL YVAADPVLTP 
LLRGIAINGN SEQEFNLLRQ TLLSTPNLQH LIDREGLHAQ GPGAREALVR RLRARVTVVP
QSRNLFTIRY VGHDPRRAYN IIAGLVNIYV ERASDHNQSD IDNAGKFLQS QIDYFHNQLK
SLEARRAAFQ AKYLELLPGS DGVSAVRASG ARVRKLETEL QDAKAEQALL ASELAKTKPL
LSETQAAGGN PALAAALANL ARLRQQYTNS YPGVQAAERQ VKALEHGPAG GGKSSYSVPV
ANPVYKALHL EILQTQTRIL ETTRALARAK VEHAKLTALA RSAPGVEAKF INLNRNYGVL
QKEYQDLISR REAMRIGAAA NIDANQVQLQ VINPPVLPRL PIGPNRRLFL VAVLVLGIGA
GLGVGVLLGE LEGRVRSEAD LRGFGIPVIG QISDISPQSG VIMPALRIGI GGSLLLGVFG
ALFIATFIIG GLG