Gene Sare_4007 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_4007 
Symbol 
ID5707429 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp4559592 
End bp4560929 
Gene Length1338 bp 
Protein Length445 aa 
Translation table11 
GC content69% 
IMG OID641273432 
ProductUDP-N-acetylglucosamine 1-carboxyvinyltransferase 
Protein accessionYP_001538788 
Protein GI159039535 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0766] UDP-N-acetylglucosamine enolpyruvyl transferase 
TIGRFAM ID[TIGR01072] UDP-N-acetylglucosamine 1-carboxyvinyltransferase 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.291 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000246371 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGGACGACA TCATCCGGGT ACGCGGTGGA ACCAGGATGA CCGGCACCGT GCACGTGGTG 
GGTGCCAAGA ACTCGGCCCT GAAACTCATG GCCGTGGCGC TGCTGGCACC GGGCCGCAGC
GTCATCACCA ACGTCCCACG GATCACCGAC ATAGCGATCA TGGGAGAGGT CCTCCGCCGA
CTCGGCTGCG GGATCCGTTT CGACGCGGAC GACCCGGTGG ATCCCATGGT GGCGCACGGT
GGTGTTCCCC GTTCCCGTTC CGTGACCATC GACGTACCGG ACGTCCCTGG AGCCGAGGCG
GACTACGAGT TGGTCCGCCG GCTGCGTGCG TCGATTTGCG TACTCGGTCC GCTGCTGGCC
CGCCGCGGGT CCGTCCGGGT GGCGCATCCG GGAGGCGACG CGATCGGCTC ACGCGGTTTG
GACATGCACG TCTCCGGGCT TGCCCGGATG GGTGCCGAGA TCTCCGGCGA GCGCGGCTTC
GTGGTCGCCT CGGCCCCGCG CGGGCTGCGG GGGGCCGAGA TCGTGCTGGA CTTTCCGAGC
GTCGGTGCTA CCGAGAACCT GGTGATGGCG GCGGTGCTCG CCAAGGGCAC CACGGTTATT
GACAACGCTG CCCGGGAACC CGAGATCGTT GACATCTGCA CGATGCTCAA CCAGATGGGC
GCGCTCATCG ACGGTGCCGG CACGTCGACC CTGACCGTCG TCGGCGTGCC GGGGCTGCAG
CCGGTCCGGC ACGCCACGGT GGGGGACCGA ATCGTCGCGG GTACGTGGGC ATTCGGTGCG
GCGATGACCC GTGGTGATGT GACCGTGACC GGAGCCTCCC CGGCCTTTCT CGATGTGGCC
CTGGACAAGT TGGTGTCGGC CGGTGCGCTG GTGGAGACCC GGCGGGGCGC CTTCCGAATC
CGGATGGCTG ACCGCCCACG CGCCGTGGAT GTGGTCACGC TGCCGTACCC CGGGTTCGCC
ACCGATCTGC TACCGATGGC GATCGGGCTC GCGGCGGTCA GCGATGGGGC CTCGCTGATC
ACGGAGAACA TTTTTGACGG GCGGTTCATG TTTGCCAACG AGATGATGCG ACTCGGCGCG
GACATCCAGA CCGATGGACA CCATGCCGTG GTCCGGGGGC GGGAGCGACT GTCCGGGGCG
CCGGTGGCGG CTACCGACAT CCGCGCCGGC GCGGGCCTTC TCATCGCCGG GCTTTGTACC
GACGGTGTTA CCGAGGTCTC CCACGCGCAC CATGTGGACC GGGGCTATCC GGATTTCGTG
GCAGACCTGC GGGCGCTCGG TGTCGAGGTG GCGCGGGGTG GCGCGGCGGG GGGGCCGGGT
GTTCCCCTCG GACTGTGA
 
Protein sequence
MDDIIRVRGG TRMTGTVHVV GAKNSALKLM AVALLAPGRS VITNVPRITD IAIMGEVLRR 
LGCGIRFDAD DPVDPMVAHG GVPRSRSVTI DVPDVPGAEA DYELVRRLRA SICVLGPLLA
RRGSVRVAHP GGDAIGSRGL DMHVSGLARM GAEISGERGF VVASAPRGLR GAEIVLDFPS
VGATENLVMA AVLAKGTTVI DNAAREPEIV DICTMLNQMG ALIDGAGTST LTVVGVPGLQ
PVRHATVGDR IVAGTWAFGA AMTRGDVTVT GASPAFLDVA LDKLVSAGAL VETRRGAFRI
RMADRPRAVD VVTLPYPGFA TDLLPMAIGL AAVSDGASLI TENIFDGRFM FANEMMRLGA
DIQTDGHHAV VRGRERLSGA PVAATDIRAG AGLLIAGLCT DGVTEVSHAH HVDRGYPDFV
ADLRALGVEV ARGGAAGGPG VPLGL