Gene GSU3020 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGSU3020 
Symbol 
ID2686810 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sulfurreducens PCA 
KingdomBacteria 
Replicon accessionNC_002939 
Strand
Start bp3314633 
End bp3315748 
Gene Length1116 bp 
Protein Length371 aa 
Translation table11 
GC content49% 
IMG OID637127713 
Producthexapeptide transferase family protein 
Protein accessionNP_954062 
Protein GI39998111 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1044] UDP-3-O-[3-hydroxymyristoyl] glucosamine N-acyltransferase 
TIGRFAM ID[TIGR03570] sugar O-acyltransferase, sialic acid O-acetyltransferase NeuD family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATGAAAA CAATGGACAT AATCATTCCC AGAGAATTTG TCAGCGACGA CCAGTACCTG 
CTCCAGAAAC TCTGTGTCGT GAACAATGAC TTTGTAAAAG AAGGAGAGAT ACTTGCCCTG
ATCGAATCAT CAAAATCTGT CATTGACGTT ACGAGTCCGG CTGATGGCTA TGTCTATTTT
TTCTCTGACG AAGGAGACAT TGTGAGTGTC GGCGAGAGGT TGGCCTTGCT CGCATCAACC
AAGGAGGCAT TACAGGTTGA AATAGAGAAT TCCAACAAGA ACTCTCAGCG TCGCAAGGAG
AGTGAGACAA AGAGCGAGGA CGTTTCATTG TCTGGCGTGA GGTGCTCCAA GAAGGCATTG
CTTCTGATGA AACAGCACAA TATCGATGTA GGAGCATTTG ATGGTCTCGG GATGGTTACG
GCGCAGGACG TAGAGCACTA TCTCTCCAGC AGGGAAAAAG CCGTTAAAGC GACAGTAGCT
CCATCTTCCG TAAATAGGCA GAAAATCATC ATCCTTGGTG GCGGAGGACA TTCAAAAGTA
TGCATAGACA TACTGCGCCA GGCACAATCT TTCACAATCG CAGGAATTCT CGACTCTATC
CAGGACATTG GCGCAGAAGT GCTGGGAATC CCGGTCATTG GAAGAGACAC AATGCCGGAA
CTACTCAAGA CCAGAGAGAG TGGCATCTCC CTTGCGGTTA ACGGGATTGG ACTCATTCCG
GATCACCGGA ACAGATGCAA GCTTTTTGAG AGGCTATTGG AGGCCGGCTT TCATCTCCCT
AACCTCATAC ACCCCAAGGC ATCAATCGAA CCTTCGGCAA AACTCGGCGA AGGGAACCAG
ATCATGGCAG GAGCCATTAT CGGGAGCGAT GTCACAGTAG GAAACTACTG TCTCATAAAC
TCGGGAGTCG TCGTCTCGCA CGACTGTATC ATCGACGACC ACGTCCACCT GGCCCCCGGT
GCGCTGCTTG CAGGAGCAGT CAGAGTTGGA AGAAACTCTT TGATCGGCAT GGGCGTTACA
ATCTACGCAA AAGTAACAAT AGGAAGCAAC GTAGTTATAG CCAACGGCGC CAACGTGTTT
CACGATGTGC CGGACAACAC CGTCGTCAAG ATTTGA
 
Protein sequence
MMKTMDIIIP REFVSDDQYL LQKLCVVNND FVKEGEILAL IESSKSVIDV TSPADGYVYF 
FSDEGDIVSV GERLALLAST KEALQVEIEN SNKNSQRRKE SETKSEDVSL SGVRCSKKAL
LLMKQHNIDV GAFDGLGMVT AQDVEHYLSS REKAVKATVA PSSVNRQKII ILGGGGHSKV
CIDILRQAQS FTIAGILDSI QDIGAEVLGI PVIGRDTMPE LLKTRESGIS LAVNGIGLIP
DHRNRCKLFE RLLEAGFHLP NLIHPKASIE PSAKLGEGNQ IMAGAIIGSD VTVGNYCLIN
SGVVVSHDCI IDDHVHLAPG ALLAGAVRVG RNSLIGMGVT IYAKVTIGSN VVIANGANVF
HDVPDNTVVK I