Gene Cmaq_1801 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCmaq_1801 
Symbol 
ID5708824 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaldivirga maquilingensis IC-167 
KingdomArchaea 
Replicon accessionNC_009954 
Strand
Start bp1877835 
End bp1878848 
Gene Length1014 bp 
Protein Length337 aa 
Translation table11 
GC content41% 
IMG OID641276310 
Productaminotransferase class I and II 
Protein accessionYP_001541612 
Protein GI159042360 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0436] Aspartate/tyrosine/aromatic aminotransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000372137 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones34 
Fosmid unclonability p-value0.409693 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAGATAG GGCACTTTAA ATGGTTAAGT GAACATAAGG TTGACTTAAA CATTGCCAGC 
AGCGGTATGA TACCGGTTAG TAGGAGTGAC GTGGAAAAGT TGGGTGAACC CACTGATATC
ATGGAGGCTC TGAGCAACAT CTATAATGTA TCCACTAAGG CTATTGCGTT AACCCATGGT
ACACAGGAGG GCAATTTCGC AGTCCTATCG GCAATTAAAG ATGCTGTGGA TCAGGTTATA
ACAGTGGTGC CTGAGTATGA GCCGATTAGG GTTTTACCAA GTTTCCTTGG GCTTAGGCGA
ATTGAGGTTA AGGTGAATCA CGGGTTAAGT GAATTAATCA ACTATATTAA ACCCAGGTCA
GCTTTATTCT TCTCCAATCC AAATAATCCA CTGGGCATGC ACTTAAGTAG GGGTGAGATT
AGGGATCTTG CTGATGAGGC TAGGAGGAAG GGCTCATACT TAATCATTGA TTCAATATTC
CTTGAATTCG TCACAAGTGA TTTACGTGAC CTACCCCTGG AGAATACCGC ATACACCTTT
AGTACCTCTA AATTCTACAC TGTTGATTCC TTTAAGGTTG GTTGGATTAT TGGTGATGAG
GAGCTTATTC GAAGAGCCGT TAACGTAATT AACCTAGTAA GCCCCCTGGT TATTGGACTT
GAGGCATCCT ACGTCTCCAT AATGCTCCAG AATAGGGATT GGTTTAGGAG AAGAAACCTA
AGCATAATTT CACCTAATAG GGAATCATTA ATGAGTATTA GTGAATCATT AGGTAACTTA
ATTAAGGTAA CTTACTTCAA TCACATGCCC ATAGCCTACG TAACCACTAA GTGCAATGTT
GACAGTCTTG AATTAGCCAA TGAATTATTA AGAAGAGGAG TCTTAACTGT TCCAGGATTC
TACTTCGGTA TTAATAATGG AATTAGGATT GGCTTAGGCT CAGTTAACCA CGACACATTC
ACTAAGGCGC TTAACGTAAT GGTTAACGTG ATTACGGGCT TATGCACCAG GTAA
 
Protein sequence
MEIGHFKWLS EHKVDLNIAS SGMIPVSRSD VEKLGEPTDI MEALSNIYNV STKAIALTHG 
TQEGNFAVLS AIKDAVDQVI TVVPEYEPIR VLPSFLGLRR IEVKVNHGLS ELINYIKPRS
ALFFSNPNNP LGMHLSRGEI RDLADEARRK GSYLIIDSIF LEFVTSDLRD LPLENTAYTF
STSKFYTVDS FKVGWIIGDE ELIRRAVNVI NLVSPLVIGL EASYVSIMLQ NRDWFRRRNL
SIISPNRESL MSISESLGNL IKVTYFNHMP IAYVTTKCNV DSLELANELL RRGVLTVPGF
YFGINNGIRI GLGSVNHDTF TKALNVMVNV ITGLCTR