Gene Sama_1049 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSama_1049 
Symbol 
ID4603301 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella amazonensis SB2B 
KingdomBacteria 
Replicon accessionNC_008700 
Strand
Start bp1269889 
End bp1271100 
Gene Length1212 bp 
Protein Length403 aa 
Translation table11 
GC content55% 
IMG OID639780388 
Productaspartate kinase 
Protein accessionYP_926926 
Protein GI119774186 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0527] Aspartokinases 
TIGRFAM ID[TIGR00656] aspartate kinase, monofunctional class
[TIGR00657] aspartate kinase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00683315 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.60962 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACAAAAA TTTATGTAAA GAAGTTCGGA GGCACCTCTG TGGGTACCTT CGAACGCATT 
GAGGCGGTGG CAGATGCCAT CGCCAAAGCG CATTTTGAAG GTGAGAGGCA GGTGTTGGTG
CTCTCGGCTA TGGCCGGCGA AACCAACAGG CTTTATGCCA TGGCCGCCAA CATAGACCCT
CTGGCACCTG CCCGGGAATT GGACATGTTG GTGAGTGCAG GTGAGCAGGT CAGTATTGCC
CTGATGTCTA TCGCGCTGGC AAGACGGGGC GTTAATGCCA GGTCTTTGCT GGGTAGCCAG
GTCAAGGTGC GCACTAACAG CCAGTTTGGC AGAGCCAGTA TTGAGTCCGT TGACACAGGG
TTATTGATGC AGTTGCTGGA CGAAGGCGCT GTACCTGTTA TCGCCGGGTT TCAAGGCGTC
AACGAGCAGG GCGATGTGAC AACTCTTGGG AGGGGTGGCT CAGATACCAC TGCCGTTGCC
ATTGCCGCCG CACTTGAGGC GGCTGAGTGT CAAATCTTTA CTGATGTGAC CGGCGTTTTT
ACCACAGATC CCAATATAGA TCCCGATGCC CAGAAACTCG ATAGCATCAG TTTCGAAGCC
ATGTATGAAA TGGCAAGGCA GGGCGCTAAG GTATTGCATC CCGACAGCGT TGCCTGTGCA
CGCCGTCATG GCGTGGTGCT TAGGGTGTTG TCGAGTTTTG AGTCCGGCAG TGGCACCCTT
ATCCGCTTCG ATGAGCCAGA GCACTCCGGC TCGGGCATTG TGGGCATTGC CGTTACCCGT
GGACAAGGCC TGGTCTCTGT TGCCGGTTTG GTGGATCAGC CGCAGGCGGA AGTAGCCCTG
TTTCAGGCGC TGGCAAACGC CTCTGTGGAT ACTGACCTGG TGGTACAGCT GGCGGAAGAA
AAGGCACTGG CATTTACCCT GGCGCAAGGT GCACTCGATA AGGTGTTGAC CCTGATAGAC
AGGTTGGCGC TTGAGCAGCC TCTGGCGGAC GTTCGCCATG AGTCGCCATT GGCCAAGGTG
TCCCTCGTCA GCACCGGTAA AGCAGTCATG GCTGAAGTGG GGGCTCGTGT TACCGAGCTT
TTGGAAGCAC AAAACATTCG TGTTAAGTTA CTTTCGACAT CAGAAATCAA ACTGTCGGTG
GTAATCGATG AGGTGCATCT GCAGCATGCC GTCAGAAGTT TGCACAGAGC GTTTGACCTC
AATAAAGTAT GA
 
Protein sequence
MTKIYVKKFG GTSVGTFERI EAVADAIAKA HFEGERQVLV LSAMAGETNR LYAMAANIDP 
LAPARELDML VSAGEQVSIA LMSIALARRG VNARSLLGSQ VKVRTNSQFG RASIESVDTG
LLMQLLDEGA VPVIAGFQGV NEQGDVTTLG RGGSDTTAVA IAAALEAAEC QIFTDVTGVF
TTDPNIDPDA QKLDSISFEA MYEMARQGAK VLHPDSVACA RRHGVVLRVL SSFESGSGTL
IRFDEPEHSG SGIVGIAVTR GQGLVSVAGL VDQPQAEVAL FQALANASVD TDLVVQLAEE
KALAFTLAQG ALDKVLTLID RLALEQPLAD VRHESPLAKV SLVSTGKAVM AEVGARVTEL
LEAQNIRVKL LSTSEIKLSV VIDEVHLQHA VRSLHRAFDL NKV