Gene Cmaq_1813 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCmaq_1813 
Symbol 
ID5709189 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaldivirga maquilingensis IC-167 
KingdomArchaea 
Replicon accessionNC_009954 
Strand
Start bp1890588 
End bp1892030 
Gene Length1443 bp 
Protein Length480 aa 
Translation table11 
GC content45% 
IMG OID641276319 
Productmajor facilitator transporter 
Protein accessionYP_001541621 
Protein GI159042369 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones41 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTATATAA ACACAAGTCG ATTAATCGCT AATGTGGCTA GTCAACCCGG TTACACTAGA 
GAAGACATAG AGAGGGGTAT AAAAAGAATA TACCAGGTCG TTCTTGCCCA GAGCCACGTT
ACTCCATACT TCATAATTGG CATAGCCATA GCCTCCCTAT TTCTCGATGC GTATGATTTC
AGCGCATTTT CACTAGCAAC GGCAGCATTC AAGAACACAT GGCCATGGAT GTCCTCCGCC
CTCTTTGGAT TTGCCATAGC TTCCATACAA ATAGGAGCTA CGATAGGTGC TCTGACGGGT
GGGTGGTTGA ATGATAGGAT AGGCAGGAGG AACATGCTTA TACTTAACAT GATCCTGTTC
GTAGCTATGG CTATTGGTGC TGGTTTGGCA CCGGATCCCT ACACGTTCTC AATATTCAGG
ATATTGCTGG GTTATGCGTT AGGTGCAGAC ATAGTTACGG GGTTTAGCTA CATCTTCGAG
TTCCTTGAGT TCAATAAGAG ACTCGTGTTC TCTGGCGGTT TTGATGCATA TTGGTTTGGT
TCTGTGGTGT TTGCCATAGT ATTCATAGTT TTTCCACTGT ATTTTGCACT ACATTCATTA
ACGCACCCAA TAATATGGAG GGCCATCATG GTTATTGGCG GTATTGCTGC CTTCATAATT
CTTCTGTTTA GATCAAGGAT ACCTGAATCG GTGCTTTGGA TAGCATATAG GGGTAGATTA
GCTACGGCGA AACGAATAAT TAAGCAGGTA TATGGAATAG ATTTACAGGA TGTACCGGAT
GTTGACTTGG ATATACACAA GGTTCGTGGT TTCAGGAGCT TGTTTAGGAT ATTCAGGAGG
AGTAAGTGGA AGGAACTCAC CAGTACCTTT ATAGGCACCT TCGAGGGTGG AATCGAGTTT
TACTCCTTCG GTTTCTATAC TCCATATATC TTATTGGTGC TTTCAAAAAT AGGCTCACTG
GCTACCCTAG TCTCAACTAC CATAATAAAT GTTGCGGGAT TCGCGGCGGG CATTGCTACG
GCATATCTTG TTCCGAGACT TGGTACGAAG AATCTATACG TCATAGGTAC ACTGGGTACT
GGTATCTCGA TGCTTGCGGC ATCCTTCGTA TTGCCGCCCA AGATAGTGCC ACTGATAGTA
TTTTTCGCTA CAACATTCTT GGTGTTCCAC GTAATGGGAC CCAATGGTGT ACAGTCATAC
GTAATGATAA ACACGGCATA CGGACCTAGT GAGAGAGGTA CAGCAGGTGG CTGGAACTAC
TTCTTCAGTA AACTGGCGGC AGTTGTAAGC TCCTTCTGGG CACCCATTCT GTTCAGCTCG
ATCGGCGTAG TGAATACATT ACACTTCCTG GCAACATTCG CATTTATCAC TGCAGTGATA
GGTGCGGTCC TCGGATTCGA TGCGAGGAAG TATAGGACGG AGGAAGAGGC CATTCCAACA
TGA
 
Protein sequence
MYINTSRLIA NVASQPGYTR EDIERGIKRI YQVVLAQSHV TPYFIIGIAI ASLFLDAYDF 
SAFSLATAAF KNTWPWMSSA LFGFAIASIQ IGATIGALTG GWLNDRIGRR NMLILNMILF
VAMAIGAGLA PDPYTFSIFR ILLGYALGAD IVTGFSYIFE FLEFNKRLVF SGGFDAYWFG
SVVFAIVFIV FPLYFALHSL THPIIWRAIM VIGGIAAFII LLFRSRIPES VLWIAYRGRL
ATAKRIIKQV YGIDLQDVPD VDLDIHKVRG FRSLFRIFRR SKWKELTSTF IGTFEGGIEF
YSFGFYTPYI LLVLSKIGSL ATLVSTTIIN VAGFAAGIAT AYLVPRLGTK NLYVIGTLGT
GISMLAASFV LPPKIVPLIV FFATTFLVFH VMGPNGVQSY VMINTAYGPS ERGTAGGWNY
FFSKLAAVVS SFWAPILFSS IGVVNTLHFL ATFAFITAVI GAVLGFDARK YRTEEEAIPT