Gene Rcas_4330 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRcas_4330 
Symbol 
ID5541843 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus castenholzii DSM 13941 
KingdomBacteria 
Replicon accessionNC_009767 
Strand
Start bp5583659 
End bp5584945 
Gene Length1287 bp 
Protein Length428 aa 
Translation table11 
GC content61% 
IMG OID640896436 
ProductUDP-N-acetylglucosamine 1-carboxyvinyltransferase 
Protein accessionYP_001434372 
Protein GI156744243 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0766] UDP-N-acetylglucosamine enolpyruvyl transferase 
TIGRFAM ID[TIGR01072] UDP-N-acetylglucosamine 1-carboxyvinyltransferase 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.469868 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGAACGTT TCATCATCGA AGGCGGTCAC CGTCTTAATG GCGCCATCAG ACCGTCCGGC 
AACAAGAATG CCGCACTCCC CTTGCTTGCC GCAACCCTGT TGACCAATCA CAATGTCGTG
CTGCACAACA TTCCACAGAT CGGCGATGTT GTTTCCATGA TCGGGTTATT GGAGCGGCTC
GGCGCACGGG TCGAACGTCG CGGATCGCAC AGCTGGGCAG TGTATGCCAA CGCTGTCGAT
GCCGCAGAAC CCGATCCTGC GCTGGCGCGC AAGATCCGCG CCTCGGTGCT GCTCGCCGGT
CCTTTGTTGG CGCGACGCGG GTACGTGACC ATTCCTCGTC CCGGTGGCGA CATGATCGGG
CGACGACGTC TCGACACCCA CCTTAACGCA CTGCGTGATC TCGGCGCAGC GGTCGAAGTC
ACGCCAACAG CATATATTCT GCGCGCTGAA CGGCTACGCG GCGCCGACAT CTTCCTCGAC
GAAATGAGCG TCACCGGCAC CGAGCAGGCC GTCATGGCCG CTGTGCTCGC CGAAGGCGAC
ACTATTATCA ATAACGCTGC ATCAGAACCA CATGTCCAGG ACCTCTGCCA TTTCCTCAAC
CGGCTGGGGG CACGTATCGA CGGGATCGGC ACCAATCGTC TGCACATTCG AGGCGTGTCG
TCGCTTGGCG GCGGCGAGTA CACCATTGGT CCCGACTTTA TGGAAGTGGC GTCATTTATC
GGGCTGGCGG CAGTGACCCG CAGCGCCTTA CGCATTGTCG GCGCGCGCCC ATCCGAGCAT
CGCATGACGC GCATCGCCTT CGGGCGATTG GGCGTCGCCT GGCGTGACGA AGGGGACGAT
ATCGTCGTGC CTGCTGAACA GGAGTTGTGC ATCCGTGATG ATGTCCATAA TGCGATCCCG
AAGATCGACT CATCCCCCTG GCCCGGCTTC AATCCCGATC TGATCAGTAT CGCAATTGTA
GTAGCCACAC AGGCGCGCGG CACGATTCTG ATCTGGGAAA AAATGTTCGA AAGCCGGCTC
TTTTTTGTGG ACCGGCTCAT CGGCATGGGG GCGCGGATTG TGCTGTGCGA TCCCCACCGC
GTCGTGGTCG TCGGTCCGAG CCAGTTGTAT GGCGAACCCG ACGGGTTGCC AAGTCCCGAT
ATTCGGGCGG GCATGGCACT GCTGCTGGCA GCGCTCTGTG CGCAGGGGCG CAGTGTCATT
TACAACATCG GGCAAATCGA CCGTGGCTAC GAGCGGATCG ATGAACGTCT GCGCACAATC
GGGGCACATA TCGAACGCGC GCGTTAG
 
Protein sequence
MERFIIEGGH RLNGAIRPSG NKNAALPLLA ATLLTNHNVV LHNIPQIGDV VSMIGLLERL 
GARVERRGSH SWAVYANAVD AAEPDPALAR KIRASVLLAG PLLARRGYVT IPRPGGDMIG
RRRLDTHLNA LRDLGAAVEV TPTAYILRAE RLRGADIFLD EMSVTGTEQA VMAAVLAEGD
TIINNAASEP HVQDLCHFLN RLGARIDGIG TNRLHIRGVS SLGGGEYTIG PDFMEVASFI
GLAAVTRSAL RIVGARPSEH RMTRIAFGRL GVAWRDEGDD IVVPAEQELC IRDDVHNAIP
KIDSSPWPGF NPDLISIAIV VATQARGTIL IWEKMFESRL FFVDRLIGMG ARIVLCDPHR
VVVVGPSQLY GEPDGLPSPD IRAGMALLLA ALCAQGRSVI YNIGQIDRGY ERIDERLRTI
GAHIERAR