Gene RoseRS_2031 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRoseRS_2031 
Symbol 
ID5208993 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus sp. RS-1 
KingdomBacteria 
Replicon accessionNC_009523 
Strand
Start bp2520772 
End bp2522181 
Gene Length1410 bp 
Protein Length469 aa 
Translation table11 
GC content63% 
IMG OID640595637 
Productaspartate kinase 
Protein accessionYP_001276366 
Protein GI148656161 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0527] Aspartokinases 
TIGRFAM ID[TIGR00657] aspartate kinase 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.296392 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTTGTGA TGAAATTCGG TGCAGTTGCG GTCAGTGACG CCAGCCGGGT TAATGATCTG 
GTGCGTATCG TCCGCCACGC TATCGACGAA GGCGAAGCGG TCGTGGTTGT ATGTACCGCG
ATTGCGGACC TGACCAACCT GTTGATCGGC GCAGGACGCG CCGCAGCGCG CGGTAACCTT
ACTGCCGCCG AGCAGGCGCG CCGCGAATTG TGGCAACGCC ACCGCACGCT CGCTGAACGC
CTGGTAACCG ATGACTGGGA ACGCGAGACT CTCTACCGGG CATGGGCTGA CCTGCTCAAA
TCGTTCGACC GGATTGTACG CGCGATTGCG ACGCTCGGTG AACATTCGCC ACGCAGCAGC
GACGCCGTGG CTGCTATCGG CGAACGCTTC ATCGGGTTAT TGCTGGCAGT GGCGCTGCGG
CGCGGCGGGG TTGCGGCGCA GTTGATCGAT GGCGCCGAGT TAATCGTGAC TGATGATCAC
TTTGGCAATG CACGCCCGCT ACCGGAGGAA ACCACTGCAC GGGCCCGCGC ACGCCTGCTG
CCGTTGACCC AATCGCGGAT CGTTCCAGTG GTGACCGGGT ACATCGGCGC GACTCGCCAG
AAGATAACCA CGACGCTTGG GCGTGGCGGC GGCGATTATT CGGCAACGCT GATCGCCGCT
GCGCTCGAAG CCGATGAAGT CGTGATCTGG ACAGATGTGC CCGGCATTCT TACTGCCGAT
CCGAAACTGG TGCCCGAAGC ACGCACACTG CCGGAACTGT CGTATATCGA AGCCACTGAG
ATCGCCACCC TTGGCGCGGA GGTGCTCCAC CCACGCTCCC TGACTCCGCT CGCCAATCGC
AACATTCCGC TGCACATCCG CAGCCTGGAA CAACCCCACA TTCCGGGCAC GCGAATCGTT
GCCGCACCGC ACATCTCTTC TGACACAGCA CGCACGATCA TCTCGGCGCC GTCCATCAGT
CTGATCGAGA TCAGCATGAG TCCTCTGGCG GCAGCTGAAC TTGGATGGGC GCCGGACCTG
GCGGCGCGTA TCCTGGCAGA ATTGACCGGA TGCGGCATCG AAGTGCTGAC CTTCGCGCAG
AGTTTCAGCG AACGAGGGTT GGTGCTGGCA GTGCGTGCCA CCGATGCCGA GTATGCCTAT
GAGCGTATCG AAGCCTGCCT GCAACCAGAG CGGGACAGCA AGGCGCTGCG TGCGATCAGT
TTGCGTGCGC CGGTGGCGCT GGTGGCGGTC ATCAGTGCGC CGGAGAGTAC ACGTCTGGCG
CCGCGCGCGC TGACAGCGCT GGCGCGGGTG CAGGGCACGG TGCTGGCGAT GGTTCACGGC
AACACCTCAC GGCACCTGTC ATTCATCGTG CCAGAAGAGG AATTGAGCGC CGTCGTGCGT
GCCCTGCACC GTGAATTGAT GGCGGGATAA
 
Protein sequence
MVVMKFGAVA VSDASRVNDL VRIVRHAIDE GEAVVVVCTA IADLTNLLIG AGRAAARGNL 
TAAEQARREL WQRHRTLAER LVTDDWERET LYRAWADLLK SFDRIVRAIA TLGEHSPRSS
DAVAAIGERF IGLLLAVALR RGGVAAQLID GAELIVTDDH FGNARPLPEE TTARARARLL
PLTQSRIVPV VTGYIGATRQ KITTTLGRGG GDYSATLIAA ALEADEVVIW TDVPGILTAD
PKLVPEARTL PELSYIEATE IATLGAEVLH PRSLTPLANR NIPLHIRSLE QPHIPGTRIV
AAPHISSDTA RTIISAPSIS LIEISMSPLA AAELGWAPDL AARILAELTG CGIEVLTFAQ
SFSERGLVLA VRATDAEYAY ERIEACLQPE RDSKALRAIS LRAPVALVAV ISAPESTRLA
PRALTALARV QGTVLAMVHG NTSRHLSFIV PEEELSAVVR ALHRELMAG