Gene Cmaq_0731 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCmaq_0731 
Symbol 
ID5708628 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaldivirga maquilingensis IC-167 
KingdomArchaea 
Replicon accessionNC_009954 
Strand
Start bp764621 
End bp765643 
Gene Length1023 bp 
Protein Length340 aa 
Translation table11 
GC content43% 
IMG OID641275232 
Producthomoserine dehydrogenase 
Protein accessionYP_001540557 
Protein GI159041305 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0460] Homoserine dehydrogenase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones33 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.00124152 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGATTAAAA TAGCCTTAAT AGGGTTTGGT AATGTTGGGA AATCCTTCGC AAGGGTATTA 
ATCAATAGGG CCACTGAATT AACTAAGCTG GGGCTAGAAC CCTGCGTAAT AGGTGTACTT
GCATCAAGGG GTGGTTTAAT TAATGATAAC TGCATAGGTG GTGGAACACT CATGGACTTA
GTTAATAAAG GCCTATATAA TGCGCCTGGC TTTAAGGCAG TTAATTTAAG GGACTTAATT
AAACTTAAAC CAGACATCGC AGTAGTATCA ATACCACCAA GTTACCGCAC TGGTGAACCG
AATCTAACAA TATATAGGTT ATTGATAAGT GAAGGCGTAT CAGTTATTAC CGCTGATAAA
ACAGGATTAG CCTTAGCCTA CTGGGAATTA ATCAATGAGT CAAGGGCTAG GGGAGTCTTC
CTAGGCTTCA CGGCTACGGT TATGGCTGGT ACGCCGGTGA TTCAACTGAT TAAGGGATTA
AGGGGTAGGG TTGTTGAAAG CATTGAAGGT GTTTTAAACG CTACGTCAAA TTACGTCCTA
ACCCTTGTGG AGAATGGATT AACCATGAGT GAAGCCGTTA AGAAGGCTAT TGAGGAGAAG
ATCGCTGAAC CAGACCCAGC AATAGACCTA GGTGGCTTAG ACGCTGCCGC TAAGGCAACC
ATACTTGCTA ATGTGCTGGG CCTTAATGTG AGCTTAAGGG ATGTTAATGT TCAATCATTA
ATGAGCCTTA AGGATGATTA CATTAAGGAA TGCGCAAGGA GGGGTGTTAG GGTTAAGCAA
GTTGCCTCAA TAAGCTTAAC CAGCAGGGTA TTAAGCGTTA AGCCAATGGA GGTTCCAACA
GGTAGTGTGC TTGGCTCCAT TACAGGTAAC TACAATGCCT TAGTAATTAG GCTTAATGAT
GGGAAGGAAA TAACGGTAAT AGGCCCAACA GGCCCGGCTG AGGCAACAGC CGAGGTAATG
TTCAGTGATT TACTTGAATA CGCCGACTTA TTACTAACCA TGGGAAAGCG TATTAAGGGT
TAA
 
Protein sequence
MIKIALIGFG NVGKSFARVL INRATELTKL GLEPCVIGVL ASRGGLINDN CIGGGTLMDL 
VNKGLYNAPG FKAVNLRDLI KLKPDIAVVS IPPSYRTGEP NLTIYRLLIS EGVSVITADK
TGLALAYWEL INESRARGVF LGFTATVMAG TPVIQLIKGL RGRVVESIEG VLNATSNYVL
TLVENGLTMS EAVKKAIEEK IAEPDPAIDL GGLDAAAKAT ILANVLGLNV SLRDVNVQSL
MSLKDDYIKE CARRGVRVKQ VASISLTSRV LSVKPMEVPT GSVLGSITGN YNALVIRLND
GKEITVIGPT GPAEATAEVM FSDLLEYADL LLTMGKRIKG