Gene Gura_2580 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGura_2580 
Symbol 
ID5163616 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter uraniireducens Rf4 
KingdomBacteria 
Replicon accessionNC_009483 
Strand
Start bp2984560 
End bp2986122 
Gene Length1563 bp 
Protein Length520 aa 
Translation table11 
GC content54% 
IMG OID640550077 
Productlipopolysaccharide biosynthesis protein 
Protein accessionYP_001231331 
Protein GI148264625 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG3206] Uncharacterized protein involved in exopolysaccharide biosynthesis 
TIGRFAM ID[TIGR03007] polysaccharide chain length determinant protein, PEP-CTERM locus subfamily 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACAACAG GCAATATGAC GGTACACGAT CTTATGGATA TTCTGAAGCG GAGGAAGTGG 
AGCTTGCTCC TGCCGGCGGC AGCGCTCTTT CTTGTTGCAG TGAGCGTGGC ATTTATCCTG
CCGCCGATCT ATCGCTCCAC CACGACCATT CTGATCGAGG AGCAGGAGAT CCCTCCCGAG
ATGGTTGCCA CCACGGTGAC GAGCTTTGCC GAACAGCGGC TGCAGGTGCT CAACCAGCGC
ATCATGAGTT CCACGAGGCT TTTGGAGATC ATCAATCGTT TCAATCTCTA CGCGGACATC
AAGGACAAAA TAACGACGGA AGAGATGATC GAGAAGATGC GCAAGGACAT CAAGTTCGAT
ACGATCAGCG CCGATGTCAT AGATCGCCGT ACCGGCCGGG CGACTCAGGC CACCATCGCT
TTTTCCCTGT CTTACTCAGC CAGGAACCCT GCGACTGCCC AGCAGATCGC CAACGTCCTG
GCTTCCCTTT ATCTGGAAGA GAACCTGAAG GTGCGTGAGC AATCCACGTC GGGGACTTCG
AAGTTTCTTG AGGACGAGAT GAAGGACGTG CAGGCGAAGC TGGTCGGGTT TGAAGCGCAG
ATTTCCGCTT ATAAGCAGCG AAACCTGAAT TCCCTGCCGG AACTGGTTCA AACCAACCTG
TCGGAGCTGG ACCAGGTGGA GCGTAGCATT ATCCAGTTCA ATGACCAGTT GCGCACCCTG
AAGGAGAAGG AAGGTTACCT GCGGAGCCAG CTTGCGAACA TCACGCCCGA AGACGAGAAT
CAGGACAAGA CCCGCCTCAA TGATCTGAAA GCGAAACTGG TGAACCTGAA GAGCCGCTTC
TCCGATGAGT ACCCCGATGT AAAAAAACTT CAGCAGGAGA TTGCGACTCT GGAAAAGCAG
CTCCACACAG TCGGCGGAGA TGTAAAGTCT ATCCGTGCCG ATAATCCGAA CTATATTAAT
CTGGCTTCCC AACTGGCCGC CGCCCAGTCG GAAATCGACT CGGTGAAACG CCAGCTTGCA
CAGTTTCACG ACAAGCGTGA TTCTTACCGC AAACGGATTC AGGCTGCGCC GAAGGTTGAG
GAAGGGTTTA AAAACCTGAT GGTCGAGCGA AACAACATGC AGTTGAAATA CGATGATCTT
TCGAAAAAAT TTCTGGAAGC CAAGGTCGCC CACGGCCTGG AGAAAGAGCA GATGGGCGAA
CGGTTCACCA TCGTCGATGC GGCCAGGCTA CCTGAAAAGC CGGTGAGTCC CAATGTGCCG
GTTATCATGC TGATCGGCCT GATTCTCGGG ATCGGCAGCG GGGTAGGCGT TGCGACCATT
CGCGAAACCG GCGACAAATC AGTGCACAGC ATGGAGGTCT TGGCCAAGGC AACCATGTAT
CCCGTGCTTG CCGCCATTCC TGAAATCGTC ACCTGGCAGG ATCAGCAACG GCAGCTGAGA
AGACGCAGAT CGCTTCTTGT TGCGGGCATA ATGATCATTC CCATTTCCCT GCTGGCAATT
CATTTTCTGG TCATGGACCT GAGTGTGGCC TGGGCCATTT TCAAGCGCAG AATGGCTCTT
TGA
 
Protein sequence
MTTGNMTVHD LMDILKRRKW SLLLPAAALF LVAVSVAFIL PPIYRSTTTI LIEEQEIPPE 
MVATTVTSFA EQRLQVLNQR IMSSTRLLEI INRFNLYADI KDKITTEEMI EKMRKDIKFD
TISADVIDRR TGRATQATIA FSLSYSARNP ATAQQIANVL ASLYLEENLK VREQSTSGTS
KFLEDEMKDV QAKLVGFEAQ ISAYKQRNLN SLPELVQTNL SELDQVERSI IQFNDQLRTL
KEKEGYLRSQ LANITPEDEN QDKTRLNDLK AKLVNLKSRF SDEYPDVKKL QQEIATLEKQ
LHTVGGDVKS IRADNPNYIN LASQLAAAQS EIDSVKRQLA QFHDKRDSYR KRIQAAPKVE
EGFKNLMVER NNMQLKYDDL SKKFLEAKVA HGLEKEQMGE RFTIVDAARL PEKPVSPNVP
VIMLIGLILG IGSGVGVATI RETGDKSVHS MEVLAKATMY PVLAAIPEIV TWQDQQRQLR
RRRSLLVAGI MIIPISLLAI HFLVMDLSVA WAIFKRRMAL