Gene Cmaq_1804 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCmaq_1804 
Symbol 
ID5709954 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaldivirga maquilingensis IC-167 
KingdomArchaea 
Replicon accessionNC_009954 
Strand
Start bp1880750 
End bp1882162 
Gene Length1413 bp 
Protein Length470 aa 
Translation table11 
GC content43% 
IMG OID641276313 
Productmajor facilitator transporter 
Protein accessionYP_001541615 
Protein GI159042363 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000239079 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones38 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTCAGG GCAACTACAG CAGGGAGGAG ATCGAGAGAG GTATTGCTAG GATCTATGAG 
ATTGTGTTAA ATACTAAGAA TATAACAGCT AGATACATAG TAATTCTTGC CTTGGCCTCG
CTATGGGTTG ATGCCTATGA TTTTGCAGCA TTCACATTTG CAACAGCAGC ATTTAAGAAT
ACGTTTCCCT GGATGTCCAC GTGGCTCTTT GGCCTAGCAG TAGCTGCGGT TCAAATAGGA
GCTACGGTAG GTGCTGTAGT TGGTGGTTGG TTAACTGATA GAATTGGTAG AAGAAACATG
TTCATCTTGA ATATGATACT CTTTACGGTA ATGGCAGTTG GGGCCGGTCT TGCACCGGAT
CCATATACTT TCACAGTCTT TAGAATATTA CTTGGTTTTG CATTGGGTGC AGATACGGCA
ACAGGTTTTG CGTACATATT CGAATACCTA GAGAAACAGC AAAGATTAGT CTGGTCTAAC
TTATGGCAGT TGCAGTGGTA TCTAATGTAT GAGGTTGTTA TATTTATCTT CGTATTACCA
TTCTACTTAA TAACTTACTC ATTACTTCAT CCATGGTTCT GGAGGATTAT TATGTTTGGA
GGGGCCGTTT TTGCGTTCAT TATATTAATG CTACGAGCAA GAATACCAGA GTCCGTACTA
TGGGAGGCAT ATAGGGGGCG TCTAGCCACT GCTAAGAGGA TCCTTAAGCG TACACACGGT
ATTGATTTAC CTGACGTACC TGATATTGAC GTAGAATTAA GAAGACCTGC TAGGGGATTA
AGGTCCGCAT TTAAGATATT CAGGAGGAAC AAGTGGAGGG AACTTGTTTA TTGCTTTAAT
GGAAACTTTG AACAGGGTTT TGAATTTTAC ACCTTTGGTT TCTACATGCC GTATATATTA
CTGACTATGC ACCTAGCCGG TTCATTAGCC ACTATAGAGG CATCTGCAAT ATTCTATGGT
ATGGGTGTAA TTGCTGGAGT CCTTACGGCA TATCTTACGC CTAGAATAGG TACTAAGTCT
CAGTACGTGA TCGGTGCTGC GTTAGCTGGC ATAACACTAC TTGGTCTTGC ATTCACATTC
CTGTATCACT GGCCCCTCTG GCTATTCGTA TTATTTGCTT CTGCATTTTA CTTTGGGCAT
GTTATTGTGC CTGCAAGCCA GGGTATGACG TCCATAAATG CGGCCTTCGG TGCCAGTGAA
AGAGGCACTG CAGCAGGCTG GGGTTATTTC TGGGTTAAGT TAGCCGCTGT TGTAGGTTCT
TTCATAGCGC CCACATGGTT AGCCGTTCTC GGTGCCCCTA AGATGACGGA AATACTGGGT
ATCTATGCTT TAGCAACTGC AATCTTGGGA TTAGCAATAG GTTTTGATGC TAGGAAGTAT
AAACCACCTG AAACAGAGGA GGTTGCTGTT TAG
 
Protein sequence
MSQGNYSREE IERGIARIYE IVLNTKNITA RYIVILALAS LWVDAYDFAA FTFATAAFKN 
TFPWMSTWLF GLAVAAVQIG ATVGAVVGGW LTDRIGRRNM FILNMILFTV MAVGAGLAPD
PYTFTVFRIL LGFALGADTA TGFAYIFEYL EKQQRLVWSN LWQLQWYLMY EVVIFIFVLP
FYLITYSLLH PWFWRIIMFG GAVFAFIILM LRARIPESVL WEAYRGRLAT AKRILKRTHG
IDLPDVPDID VELRRPARGL RSAFKIFRRN KWRELVYCFN GNFEQGFEFY TFGFYMPYIL
LTMHLAGSLA TIEASAIFYG MGVIAGVLTA YLTPRIGTKS QYVIGAALAG ITLLGLAFTF
LYHWPLWLFV LFASAFYFGH VIVPASQGMT SINAAFGASE RGTAAGWGYF WVKLAAVVGS
FIAPTWLAVL GAPKMTEILG IYALATAILG LAIGFDARKY KPPETEEVAV