Gene Cmaq_1249 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCmaq_1249 
Symbol 
ID5708951 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaldivirga maquilingensis IC-167 
KingdomArchaea 
Replicon accessionNC_009954 
Strand
Start bp1319459 
End bp1320559 
Gene Length1101 bp 
Protein Length366 aa 
Translation table11 
GC content41% 
IMG OID641275754 
Productmajor facilitator transporter 
Protein accessionYP_001541066 
Protein GI159041814 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.000314091 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCATAGGA ACGTTTACTT AATCTTACTC ATTAAGGGAC TGAGAACCTT CGTCTTCGGT 
ATTGTTAGTG TACTGACACC CATTTACTTA GCCATGTTGG GTTTCCCACC CATTTATGTG
GGTGCCTCTT TATTCCTTAT GGTTCTCGGC AATGTTCTCT CAAACATATT ATTAACCTGG
TTTGGTGACG TAATTGGTAG GAGAAGATTA TTAATCATCC TTAGTTTATT CATGTTTATT
TCAGGCATAT TATTGTTCTC ATCCTCATTA TACCCAGTAA TGGCATTAGC ACTACTTATA
GGTAACATAA GCACCACAGG AACTGAGGCT GGTCCCTTTC AATCAATTGA AACAGGCGTG
TTACCTAGGT TTACTGGTGA TAGGCTAGGT AGGATCCTAG GTGTTTACAA TCTCATTGGT
TACTCCGCTT CATCAATTGG CGCCCTTGCG TCATCATTAC CAGCCTACCT TGGGAATAAC
ATACATGTAA TTAGGTCAAT GTACCTAATT TATGCCCTTG CTGGTTTAAT AATGATTATT
GTTTATAACA CATTAAGTGG TATTGAGACC ACTAGGAGGG ATTTAGGGTT GAGGGGGTTA
AGTAGGTCTG CGGTCGCTGA TATTAGGAAT CTATCAATAT TATTCTCAAT AGATGCATTC
GGCGGTGGGT TGGTGACGCA GTCATTATTA TCATACTGGT TCTATATTCG TTATGGCGTA
TCCTTGAGGG AATTAGGTGT TGTATTCATG ATTGTTAACG TGGTTACAGC ATTATCGTTA
ATTATTGCAC CATTAATAGC TGAGAGGATT GGTAATTTAA GAACAATGGT GTATACGCAT
ATAGTATCAA ATGTCTTCCT AATATTAGTG CCGTTGGCTG GAACATTCCT GGGAAGCTTC
ATATTCCTCC TATTGAGGCA GAGTGTCTCT CAAATGGATG TACCGACTCG GCAAGCGTTT
ATGGTGCAGA TATTTAAGGA TGAGGAAAGA GTCGCCGCTA ACGCCATAAC CAACACTGCA
AGGAGCATAA GCACCTTACC TGGATCATTA ATAGTTGGTG ATAAAAGAGG TGGCAAACTT
CGCCTTTTCA AGGCGGGGTA G
 
Protein sequence
MHRNVYLILL IKGLRTFVFG IVSVLTPIYL AMLGFPPIYV GASLFLMVLG NVLSNILLTW 
FGDVIGRRRL LIILSLFMFI SGILLFSSSL YPVMALALLI GNISTTGTEA GPFQSIETGV
LPRFTGDRLG RILGVYNLIG YSASSIGALA SSLPAYLGNN IHVIRSMYLI YALAGLIMII
VYNTLSGIET TRRDLGLRGL SRSAVADIRN LSILFSIDAF GGGLVTQSLL SYWFYIRYGV
SLRELGVVFM IVNVVTALSL IIAPLIAERI GNLRTMVYTH IVSNVFLILV PLAGTFLGSF
IFLLLRQSVS QMDVPTRQAF MVQIFKDEER VAANAITNTA RSISTLPGSL IVGDKRGGKL
RLFKAG