Gene Cmaq_1492 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCmaq_1492 
Symbol 
ID5709121 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaldivirga maquilingensis IC-167 
KingdomArchaea 
Replicon accessionNC_009954 
Strand
Start bp1568100 
End bp1569395 
Gene Length1296 bp 
Protein Length431 aa 
Translation table11 
GC content42% 
IMG OID641276001 
Productamino acid permease-associated region 
Protein accessionYP_001541306 
Protein GI159042054 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0531] Amino acid transporters 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value0.993916 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones34 
Fosmid unclonability p-value0.460323 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGCCA GGGAGGTTTC GTTGAGAAGG GTGCTTAATT TCATGGAGCT GGTGGCATTA 
GGCTACTCCG ACGTATCATC AACATACTAC TTCTCACTTG GCGTAATAGC ACTATACTCC
GGTTCATCAT TACCCGTCAC CATGATGCTT GGCTCTATTC CACTATGGGT TGCTGGGTTA
ACGTACAGTG AATTAGCTAA AGTAAGCCCT GAGGTTGGTG GTGCATACTA TTACGTGAAC
CTTGGCCTAG GCCGCTTTGG AGGCTTTATT GCAGGTTGGT TACTTGGCTT TGATCAAATA
CTAATGATGG CCTACGGCGC CTTAGGGTTC GCGAATTATC TACTAACGGC ATTAATTGGG
TTAAACCACG GTGGATTCAT CATTACGCTT ACTTCACTGG CGATTATTTG GTTTCTGGCT
ATTTTAAACA TAATTGGCAT TAAGTTATCA GCTAGATTAA ACCTAGTACT AGTTATGATC
GATATCATTG GTATCTTAAT ACTAACCACC GCTGGGTTTT ATAGATTAAT GCATATGCAT
GAGTCAATTA ATGCAGTGAG TCTAAGCATT GCACCAGTGG GTTTAGCATA TGCCTTGAGG
GGTTACATTG GTATTGATGT AATAGCTCAA GCAGCAGGGG AGGCTATGGA GCCGAGTCGC
AATGTACCTA AATCAATAGT AACCATATGC ACCCTCTCAA CTGTTGTTGC AATACTAGTT
TCAACATTAG CCGTCTACAC GGGTGGCGTC AAGGTAATGA TGATGCATCC TGAAGACCCA
CTCTCAGCAC TTGCAGTGAA CCTAATCGGC TTCAATGCGT TGAGCATATA CATATCGGTA
TCCATAGCCC TAGTGATGCT TCTAAGCGTT AACTCCGGGA TAGTGGACTT CTCCAGGGGC
CTCTATAGGA TGAGTATTGA TAGACTCCTA CACAAATCAA TATCAAGTGT TCATTCAAGG
TTCAAGACTC CATTTGCATC AATAATAGTG GCTTCAATTA CATCATCATT ATTCGTAATA
CCTAACGATG TTGAATTAAT AGTAGGCTCA TACGGCATAG CCTCACTAGT GGCGTACACA
TTAGCCTTAC TATCACTAAT AAGACTTAGG GATAAGTCAC CACTAATGGT AATTGGATTA
ATGGCATTAA TTACGGCTAT ACTGTTAACG CTAATATTCA AACCATACTA TGCGATACCA
GTGTCACTGT GGTTCGCCAT AGGGCTTATA CTACTAGCCA TGACAAGTAA ACGGCTAAGA
CTAATTAAGT TAACCCCACA TAGGCCTCAT GAATAA
 
Protein sequence
MSAREVSLRR VLNFMELVAL GYSDVSSTYY FSLGVIALYS GSSLPVTMML GSIPLWVAGL 
TYSELAKVSP EVGGAYYYVN LGLGRFGGFI AGWLLGFDQI LMMAYGALGF ANYLLTALIG
LNHGGFIITL TSLAIIWFLA ILNIIGIKLS ARLNLVLVMI DIIGILILTT AGFYRLMHMH
ESINAVSLSI APVGLAYALR GYIGIDVIAQ AAGEAMEPSR NVPKSIVTIC TLSTVVAILV
STLAVYTGGV KVMMMHPEDP LSALAVNLIG FNALSIYISV SIALVMLLSV NSGIVDFSRG
LYRMSIDRLL HKSISSVHSR FKTPFASIIV ASITSSLFVI PNDVELIVGS YGIASLVAYT
LALLSLIRLR DKSPLMVIGL MALITAILLT LIFKPYYAIP VSLWFAIGLI LLAMTSKRLR
LIKLTPHRPH E