Gene Cmaq_1153 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCmaq_1153 
Symbol 
ID5709897 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaldivirga maquilingensis IC-167 
KingdomArchaea 
Replicon accessionNC_009954 
Strand
Start bp1209736 
End bp1210839 
Gene Length1104 bp 
Protein Length367 aa 
Translation table11 
GC content46% 
IMG OID641275652 
Productmajor facilitator transporter 
Protein accessionYP_001540970 
Protein GI159041718 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones43 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTTAGTA AGGCATCAGC CGTGGTTAAC ATAATGATCG CCAGATTCAT ATACAGCGTC 
TACTGGTACT ACCTGGCACC AGCCTTACCG TTAATTAAAC TGGAATTCAC AGTACCTAAT
TATGAACTCG GCTTAGTCCC ACTCTTCTTC ATAATAGGGG CTGGCTCATT CCAAATACCG
GCAAGCGTGA TCGCTAGGTT TATCGGTAAC GTTAAGACTG CTGTACTCGG CTTAACCCTA
CTATCAGCGG CTGGGGTAGC CACGGCCTTC AGTGTAAGCT TTAATGAAAT ACTAGCCCTA
AGGCTACTGG CCGGTATTGG CGCGGCATTA TTCTTCTCCA CGGCAGCCAC TGTTGTAACT
AATCTGTATC CTGGTAGAGA GGGGTTGATG CTTGGTATAT ATAACTCAGT GTTCAGTGCC
GGAGCCGGAG TAGGGTTGGT TTACGGGGTT GTTTACACTA TTGTTAATTG GAGGGTTGCA
GTACTGATTA TTAGCGTGGT GGGGTTGTTG GAATCCGTAA TACTCCTTAA GACCTGTTCA
CCACTCAATA GGCCCATTGA CACTGGTTTA TCCATAAACA AGGGCGCAGT ATTAGTGGGT
TTAGCCACAG CCGGGTATTG GGGGGCTAAT TACGCCGCCG GTAACCTACT ACCCACTTAC
GCCGTTAATC ATGGTGTTGG TTTAGTTAAC GCCTCATTAA TAACATCACT ACTCCTCTTC
TCAAGCCTAG TGGGTGGTTT ATCAGGTAAA TTAGCTGATT TAACCAGTAG GAGGGAGCTC
TTAATTATTG CACCCGCGGT GTTGGGTTCA TTATCATTCC TACTAATCAT AACACTTAAC
CCCTACGCCA TGATAGCCTC AACACTCATA GTGGGTTACA CCAATGAACT CATGATCACC
GCCTCCTATG CGCTTGTCGT TAATGATTCA AACCCAACCA TGAGCCTCGC AACAGTTAAC
ACGTTAAACA TGGTTGTAGG CATGTGGTTA AGCCCATTAT TCACAGCAGT CATGGGTAAT
TCAACGTTAC CATGGATCAC AATGATCATA GCCTCAGTGG CACCACTACC CCTCCTACTG
GTTAGGCGTA GGGTAGTAGG GTAA
 
Protein sequence
MVSKASAVVN IMIARFIYSV YWYYLAPALP LIKLEFTVPN YELGLVPLFF IIGAGSFQIP 
ASVIARFIGN VKTAVLGLTL LSAAGVATAF SVSFNEILAL RLLAGIGAAL FFSTAATVVT
NLYPGREGLM LGIYNSVFSA GAGVGLVYGV VYTIVNWRVA VLIISVVGLL ESVILLKTCS
PLNRPIDTGL SINKGAVLVG LATAGYWGAN YAAGNLLPTY AVNHGVGLVN ASLITSLLLF
SSLVGGLSGK LADLTSRREL LIIAPAVLGS LSFLLIITLN PYAMIASTLI VGYTNELMIT
ASYALVVNDS NPTMSLATVN TLNMVVGMWL SPLFTAVMGN STLPWITMII ASVAPLPLLL
VRRRVVG