Gene RoseRS_3079 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagRoseRS_3079 
Symbol 
ID5210047 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRoseiflexus sp. RS-1 
KingdomBacteria 
Replicon accessionNC_009523 
Strand
Start bp3869273 
End bp3870676 
Gene Length1404 bp 
Protein Length467 aa 
Translation table11 
GC content61% 
IMG OID640596670 
Productaspartate kinase 
Protein accessionYP_001277392 
Protein GI148657187 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0527] Aspartokinases 
TIGRFAM ID[TIGR00656] aspartate kinase, monofunctional class
[TIGR00657] aspartate kinase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0244781 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00296029 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCCTTTGC TGGTGATGAA GTTTGGCGGA ACCTCGGTTG GCGATGCTGC TGCGATCCAG 
GCAGTAGCTG CAATTACCGA AGCGCAACGT GCTGCCTGGG GGCGTGTGGT GCTGGTGGTC
TCCGCAATGA GTGGTGTGAC CGATATGCTG ATCCGTGGCG CGACAACCGC TGCGTCCGGC
GATAAGCACA CATTTCGCGA TCTGGATCGG ATGCTGCGCG AAAAGCATGC CGCTGCGCTG
GCGGAGCTTG TGCCCGACGA TCAGGAGCGC ACCGCTATCG ACGATCAGAT TGCACGCCTG
ATCGACGAAT TCAGCATCCT GTGTCACAGC GTCGCCGTTC TCGGTGAACT GTCGCCGCGC
GCAATCGATG CGATTTCGAG TCTCGGCGAG CGTATGAGTG TGCGGGTGGT GGCTGCGGCG
CTACGGGCGC GTGGCATTCC CGCCGAAGCG CTCGATGCTT CCGAATTCGT GGTGACGACT
GCACACTTCG GCGATGCGCG CCCGTTGCAG GAAGTGACGC GCGAACGAAC CCGCGCCCGA
ATCCTGCCGC TGCTCGGCAG AGGCATCGTG CCGGTCATCA CCGGTTTTAT TGGTGCGACC
GAACAGGGGG TGACAACCAC ACTGGGGCGC GGCGGCAGCG ATTACAGCGG TGCGATCATC
GGCGCTGCGC TCGACGCCGA TGAGGTCGAT ATCTACACCG ACGTTGATGG GGTCATGACC
ACCGACCCGC GGCTGGCGCC CGATGCCCGT GTGATACCGG TTCTCTCATA TGCTGAGATG
GGAGAACTGG CATATTTCGG CGCGAAGGTG CTGCATCCGC GCACTATTCG CCCGATCGTC
GAACGCGGCA TTCCACTGCG TGTTCGCAAT ACGTTCAACC CGTACCACCC TGGCACGCTG
GTGGTTCAGG ATGTTGAGTC GAACGGTCAG ACGGTGAAAG CCGTCACCGC CATCCGTAAT
CTGAGTCTCG TCACTGTTGA AGGACGTGGC ATGATTGGCG TTCCTGGCGT TGCGGCACGC
ACCTTCGGCG CTGTCGCCAG TGTTGGCGCG AACGTGCTCA TGATTTCACA GGCATCGTCG
GAACAGAGTA TTTGCTTTGT GGTGCCGTCC AGTACCATTC CTCAGGTCAC CTATGCGCTC
GAGCATAACC TGGCGATGGA ACTGGCGCGC CGCGATATTG ACCGCATCTG GGCGCGCGAG
GATGTCGCAA TCGTGACTGC CGTCGGCGCA GGTATGCGCG ACACGCCGGG GGTTGCGGCG
CGCGTCTTTG GCGCACTTGC CGACAATCAT ATCAATGTGA TTGCCATTGC ACAGGGATCA
TCCGAATGTT CAATCAGCAC CGTCGTCGCT GCGGCGGACT GCGATGCCGC CGTGCGCGCT
GTGCATCGTC TGACGGGAGT ATGA
 
Protein sequence
MPLLVMKFGG TSVGDAAAIQ AVAAITEAQR AAWGRVVLVV SAMSGVTDML IRGATTAASG 
DKHTFRDLDR MLREKHAAAL AELVPDDQER TAIDDQIARL IDEFSILCHS VAVLGELSPR
AIDAISSLGE RMSVRVVAAA LRARGIPAEA LDASEFVVTT AHFGDARPLQ EVTRERTRAR
ILPLLGRGIV PVITGFIGAT EQGVTTTLGR GGSDYSGAII GAALDADEVD IYTDVDGVMT
TDPRLAPDAR VIPVLSYAEM GELAYFGAKV LHPRTIRPIV ERGIPLRVRN TFNPYHPGTL
VVQDVESNGQ TVKAVTAIRN LSLVTVEGRG MIGVPGVAAR TFGAVASVGA NVLMISQASS
EQSICFVVPS STIPQVTYAL EHNLAMELAR RDIDRIWARE DVAIVTAVGA GMRDTPGVAA
RVFGALADNH INVIAIAQGS SECSISTVVA AADCDAAVRA VHRLTGV