Gene Cmaq_1199 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCmaq_1199 
Symbol 
ID5708983 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaldivirga maquilingensis IC-167 
KingdomArchaea 
Replicon accessionNC_009954 
Strand
Start bp1263568 
End bp1264917 
Gene Length1350 bp 
Protein Length449 aa 
Translation table11 
GC content44% 
IMG OID641275703 
Productmajor facilitator transporter 
Protein accessionYP_001541016 
Protein GI159041764 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.329766 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.000717058 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAATAGGA GCAGTGAATT AAGGAGGGTG CATTACTTAA TATTCATGAG CTTCGCCCTA 
GGCTTCCTAA TATGGGGTTT TGTATCAACC AGTGGAATAA TGACTATAGA CTACTTCAAG
GATTACATAC CTAAGTGGCT CCTACCAGTC TCAGTGGTAC TTGGCTACAT ATTCGTAATG
CTAGGCGACA CTGTGATGGG TTTCCTAACG GATAGGGTGG GTAGGAAGAG AATATTCATA
TACACCATGA GCCTTTACAC CATTGGATTA CTGGGGATGG CTGCATCACT TTATGTTAAG
CAGGTGATCA ATCCTCTGAT AGCCTTCATC ATCTTAATGG TATCCTACGC CCTGGCTGAA
TTCGGCGTTG GTGGTGAAGA GCCACCAGCA TTAGCCGCTA TCTCGGAGCT TATGCCAAGT
GGCAGTAGGG GTATGATGCT TGTGCTCACG CCCAATTTCG ATAATATTGG TGCAGCCCTA
GCTGCAGCAG TGCTTTATGC CGCATTAGTT TACACCGGTT CCGCTAGTGT ATCATCAATA
TACGCCATGA TTGGTTCAGC CCTAGTGGTG GTTTTCCTGA CTATACTGGT TAGGTTACGT
ATACCTGAAT CAGTGAGGTG GCTTGAATCT AGGGGTAGGG TTAATGAGGC TGTTGAGATT
GCCAAGAGGG AGGGTCTTGA ATACGCGTTG AGTTCAGGTA ACTCGGTGGT GCAGTTTAAG
GCTCCACCAG CTTGGTATAG GGCATTATTC CTCTCAATAA TAGGCTTCAC TCAAATAACC
ACCTATGGCT TAATGGCGTA CACAATAATT TACCTACCAT CATTACCCTT CAGTAACAAT
TACAATCTAC AGGCACTGGT GATACTGTTG GCTAACCTAG GCGCCTCAAT AGCTGGCTTA
GTGGGTTTAA TAATGGATAA GGTGGGTAGG AGACCATTCA CACTCTTCGC TTACCTAGGG
GGCTTAGTAA CCATGGTACC CATATTCCTA ATATACGCAG CCTCCAATAC ACCATTAAAG
GCATCACTAC CAGTATTCTA CACTCTACTC TTCCTCAACA TGGTCTTCAG TGAATTCGAA
TGGGCTGTTA GAACTGTGCT TGAGCCTGAA TTATTCCCAA CTAGGGTTAG GGGAACCTGG
ATTGGTGTGA TTAGGTTAAT AGCATGGGGA ATATACGTAG TGTTAACCTA CTACCTATTA
AACATATTAA GCACATACCA GTACCTGCTC ACTAACCTAA TACTATACGC AATCGGGGCT
GCCGCCGCAG TGACATGGTT CATTTACGGT ATTGAAACCA AGGGAATACC AATAAGCACC
TTAGATAACA TAATGAGTAA ACAAAGTTGA
 
Protein sequence
MNRSSELRRV HYLIFMSFAL GFLIWGFVST SGIMTIDYFK DYIPKWLLPV SVVLGYIFVM 
LGDTVMGFLT DRVGRKRIFI YTMSLYTIGL LGMAASLYVK QVINPLIAFI ILMVSYALAE
FGVGGEEPPA LAAISELMPS GSRGMMLVLT PNFDNIGAAL AAAVLYAALV YTGSASVSSI
YAMIGSALVV VFLTILVRLR IPESVRWLES RGRVNEAVEI AKREGLEYAL SSGNSVVQFK
APPAWYRALF LSIIGFTQIT TYGLMAYTII YLPSLPFSNN YNLQALVILL ANLGASIAGL
VGLIMDKVGR RPFTLFAYLG GLVTMVPIFL IYAASNTPLK ASLPVFYTLL FLNMVFSEFE
WAVRTVLEPE LFPTRVRGTW IGVIRLIAWG IYVVLTYYLL NILSTYQYLL TNLILYAIGA
AAAVTWFIYG IETKGIPIST LDNIMSKQS