Gene Gura_3214 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGura_3214 
Symbol 
ID5167039 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter uraniireducens Rf4 
KingdomBacteria 
Replicon accessionNC_009483 
Strand
Start bp3776727 
End bp3777767 
Gene Length1041 bp 
Protein Length346 aa 
Translation table11 
GC content57% 
IMG OID640550699 
Productlipopolysaccharide heptosyltransferase I 
Protein accessionYP_001231948 
Protein GI148265242 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0859] ADP-heptose:LPS heptosyltransferase 
TIGRFAM ID[TIGR02193] lipopolysaccharide heptosyltransferase I 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGGGTCC TGATCGTTAA AATGAGCGCC ATGGGTGATA TAATCCATGC CTTGCCGGTA 
TTGGATTACC TGCACAAGGT TTCACCGGGG ATCGAGATCG ACTGGCTGGT CGAGGAGCCT
TTCCTGGACG TTGTGGCCGG AAATCCGCTG ATCAGCACGA TCCATACGGC ACGCACCAAG
GTCTGGCGCA AGCGGCCTTT CGCCGTCAGT ACATTTCGTG AGATCGGCGC ATTGAAGCAG
GCGTTGCAGG AGAGGGCATT CGATGTCGTT TTCGATATCC AGGGGAACAT CAAGAGCGGC
GTCTTCGGCT GGCTGAGCGG TGTCGACAAC CGGATCGGCT TCAATGCCGA CGTCCTCCAG
GAACGGCTGA ACATGATGTT CACGACGCGC CAGATACCGT TGCGGCCCCA TGATTACCAT
ATCACCGACC AGTACCTGCG GCTTGTCAGC GTCCCCTTCG GTCGCGATTT CAGGAAAATG
CAGCTTTCAT CCGACATTTA CACGACGCCG GAGGACGACG CTGTCGCCGA GACACTTCTC
TCCACCCTGG CCGACGGGCT GGTATTCCTG TTCCATTACG GGACCACCTG GCAGACCAAG
TTCTGGAGCG AGAAGAGTTG GATCGAGCTG GGGAAGGCTT TGCTGGATAG GTTTTCGGAA
TCGTCAATCC TCCTTTCCTG GGGTAATGAT ACGGAGCGCC GTGTAGTAAT GGGTATTGCC
GCGGGTATCG GGCCCGGCGC CAGGGTGATT GAACGGTATT CGTTGAAAGG CTTGACCGCC
CTGCTGAAGA AGGTCGATCT GGTGGTGGGA GGTGACACGG GCCCTGTGCA TCTGGCCGCA
GCGGTCGGCA CGTCGACCGT TTCCTTTTAC CGCTCTTCCG ACGGCAAAAG GAGCGGTCCG
CGAGGCGATG GTCATGTGAT TGTCCAGTCC CCACTGCTGT GCGCCAGGTG TTTCAGGACA
CGATGTGACA AGGATGAGGA ATGCCGGCAA AGCATAACGG TGGAAGCCGT TGTATGTGGT
GTTGAAAAAC TTTTAACCTA G
 
Protein sequence
MRVLIVKMSA MGDIIHALPV LDYLHKVSPG IEIDWLVEEP FLDVVAGNPL ISTIHTARTK 
VWRKRPFAVS TFREIGALKQ ALQERAFDVV FDIQGNIKSG VFGWLSGVDN RIGFNADVLQ
ERLNMMFTTR QIPLRPHDYH ITDQYLRLVS VPFGRDFRKM QLSSDIYTTP EDDAVAETLL
STLADGLVFL FHYGTTWQTK FWSEKSWIEL GKALLDRFSE SSILLSWGND TERRVVMGIA
AGIGPGARVI ERYSLKGLTA LLKKVDLVVG GDTGPVHLAA AVGTSTVSFY RSSDGKRSGP
RGDGHVIVQS PLLCARCFRT RCDKDEECRQ SITVEAVVCG VEKLLT