Gene Cmaq_1143 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCmaq_1143 
Symbol 
ID5710143 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaldivirga maquilingensis IC-167 
KingdomArchaea 
Replicon accessionNC_009954 
Strand
Start bp1198814 
End bp1200013 
Gene Length1200 bp 
Protein Length399 aa 
Translation table11 
GC content41% 
IMG OID641275642 
Productmajor facilitator transporter 
Protein accessionYP_001540960 
Protein GI159041708 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones34 
Fosmid unclonability p-value0.41427 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCATTTA AGGCTATTGG GCGTGAGGGA GTATTACTTG CCGTGGCCAG TTCCATAACC 
GGCATTATGT TCGGTGCTAA CTCAGTCATA TTATCAATAT ACATGCTTAA CATAGGTATG
AAGCCGACTT TAATAGGTGT TGTTATTGGT GCTTCATCCC TCATGAGTGC CCTTGGATCA
TTAATCACCG GCTACTTATC CGACTTCATT AATAAGTTAA GCCTATTCAC GTTTCTCTCA
TTAGCAAGTG GTTCATTGAT ACTACTGTTG GTTACTGGTT TACCACCAGT GATAACTATG
GTTTACCCGC TCATTGCCTT GCTTAACCGC AACGTTATAT CCATTGCTAT TTCCGGTGAG
TATGCTAGAC GAAGGGGGAT ATCCAGTGAA TTCTTCAGCT TATCATCTTC ACTTAACGTA
GTATTCAGTG TTATTGGGTC ATCAATAACT ATGTTACCAA GCTACATGGG TAGAATGGGG
TATGACTTAG TCTTCATTAT TGAATCACTA TCAGTGTACT CATCAATACC AATAATGCTT
ATAGCCATTA GGAGAATAGG CATTAATGTA ACTGAGGTTA AGATCAGCAG GGTTAGTTTA
AGGGAGTTAA GGGAGTTGAA GTCATCGTGG TTACTTAAGA GGCTTATTCC CGAATCATTA
ATAGGACTTG GGGCGGGGGT AATAATTCCC CTCTTTAGCC TGTGGTTTTA CTTAAAGTTC
CACATAAATA TAAGTAACTT AAGCATAGTG TACGCTGCAT CAAATGCAAC GTTAGCATTA
GGTACATTAA CGGCACCTAT GATTTCAAGA ATACTGAGAA GTAGGGTTAC CTCAGTAATA
TTACTGGAGG GTTTAGCTAC AGGTATATTA GCTTTAATGC CAATCATACT GAACATCCCT
TCATTACTGG TACTCTTCAT AGTTAGGAAC ACCTTAATGA ATATGGCTAA TCCTCTACTA
ACATCATTAA TCAACGACCT AGTGCCGGGG GAGGAGAGGG GGAGGGTTTT CGGTATATGG
ATGCTCCTAT CATCAATACC GCGTGCACTG GGTCCGGGAA TAGGGGGTTA CTTAATGGGT
TCCGGTTACC TGGATCTTCC ACTATACATA ACATCACTAC TATACGCCAC TGCAGTGGCC
TTATTCTACG TTCTGCTTAA GGATGTTGAG AAGATGAGTA GGTTAACCAT AGGTAGGTGA
 
Protein sequence
MSFKAIGREG VLLAVASSIT GIMFGANSVI LSIYMLNIGM KPTLIGVVIG ASSLMSALGS 
LITGYLSDFI NKLSLFTFLS LASGSLILLL VTGLPPVITM VYPLIALLNR NVISIAISGE
YARRRGISSE FFSLSSSLNV VFSVIGSSIT MLPSYMGRMG YDLVFIIESL SVYSSIPIML
IAIRRIGINV TEVKISRVSL RELRELKSSW LLKRLIPESL IGLGAGVIIP LFSLWFYLKF
HINISNLSIV YAASNATLAL GTLTAPMISR ILRSRVTSVI LLEGLATGIL ALMPIILNIP
SLLVLFIVRN TLMNMANPLL TSLINDLVPG EERGRVFGIW MLLSSIPRAL GPGIGGYLMG
SGYLDLPLYI TSLLYATAVA LFYVLLKDVE KMSRLTIGR