Gene Cmaq_1077 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCmaq_1077 
Symbol 
ID5709565 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaldivirga maquilingensis IC-167 
KingdomArchaea 
Replicon accessionNC_009954 
Strand
Start bp1126871 
End bp1128433 
Gene Length1563 bp 
Protein Length520 aa 
Translation table11 
GC content48% 
IMG OID641275577 
ProductNa+ symporter 
Protein accessionYP_001540896 
Protein GI159041644 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG0591] Na+/proline symporter 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00128504 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones36 
Fosmid unclonability p-value0.725494 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATTGGGA TTGCTGGTTG GGTATCATTC CTAGTACTCT TCGTAATCTT CGTGCTGCTG 
GGCTTCTATG GGGCTAGGTG GCGTAGGGGT GATTTAAGTA GGCTTGATGA GTGGGCCTTA
GCAGGTAGGA GGTTAGGCAC GTTCCTGGTT TGGTTCCTAG TTGGCGCGGA CTTATACACG
GCATACACAT TCGTCGCCGT TCCCGCTGGC GTATTTGCGA AGGGTGCATT ATACTTCTTC
GCGGTACCCT ACGTGGCCAC TGTATTCGCA TTAGCCACGG CAGTTATGCC CCCTCTCTGG
GAGTGGTCTA GGAGGAGGGG TTACATTACG GCCGCTGACT TCGTTGAGGA TAGGTTTAAC
AGTAGGTTAT TGGCTGCATT AGTGGCGTTA ACTGGTATAG TGGCTGAGTT ACCTTACATT
GCGCTTCAAA TAGTGGGTAT GAGTGCTGTG TTAGCCGTTA TGCTGCTTAA TGTTGTCCCT
GGGGCTAGCC TTGAGTTTGT GAGTGACTTA GCCTTAACCA TAGCGTTCAT AATACTGGCG
GCCTTCACAT ACACCAGTGG TTTAAGGGGT GCTACCTTAA CGGCCGTGTT TAAGGATGCG
TTAATATTCC TAAGCATAAT AGCAATACTT ATTGCCGCAC CCATAGCCAT ACCAGGGGCA
TTCCACACAG CCTTCAAGGT AGCCTCAAGC CTAAACAGCG GTAAAGGCTT ACCCACAGGC
ACCAGTAGCC TTAATTCAGT ATTCTCCTCA GCCTACTTAT CACTTTGGGT TGGTAGTTCA
CTGGCGCTTT ACCTATACCC CCACTCCGTT AATGGAGCCT TAAGCGCCGA GAGTAGGCGT
AAACTAGCTG TGAGCACGGC CCTACTGCCA ATATACGGTA TTGGGCTTGC CTTACTGGCT
CTTTACGGAA TACTGGTTTA CGCTGATTCA CAGGCCCTTA ACATTATAAG GAGTTTCCCA
TCAACTGGGT ACGCCGCTGT AATTAAAGGT AACTTACCCA TACCAGCCCT AGCAGCCACA
ATACTGCCTG ATTGGTTAGC CGGGGTGGCG TTACTGGGCA TTTTCATTGG GGGTATGGTG
CCTGCGGCAA TTATGGCTAT GGCTCAGGCG AATTTACTGA CCAGGAATGT GGCTAGGTAT
GTGGTTAAAT TAACCCCCCA GGGCGAGGCC AGGTTGGCTA AGTGGGCTTC AGTTTTCTTT
AAGTTCCTTG CACTTGTCTT CGCCTTAATA CCGGCATTAG CCTCAGTGTC AATAAACCTT
CAACTACTTG GGGGAATAAT AATAACCCAA ACACTACCCC CAATATTCCT TGGCTTAGTC
ACTAATAGGT TTAATAAGTA TGCACTCATG GCTGGTTGGG CTTCGGGAAT ATTAACAGGT
GTATACATGT TTATCACTAG GTATATTGCA ACAAAGGGTG CCTCACCAAC ACTATACCCG
ATTGCAGGCC ACCTATACTA CATTGCCGTA TTAGCATTGG TAATAAACAT ACTAGTTACC
TTAGTCGGCA CCGCATTAGC TGTGTTAATT AAGAGAAGTG CCGTTAGTAA GGTTAAGGCA
TAG
 
Protein sequence
MIGIAGWVSF LVLFVIFVLL GFYGARWRRG DLSRLDEWAL AGRRLGTFLV WFLVGADLYT 
AYTFVAVPAG VFAKGALYFF AVPYVATVFA LATAVMPPLW EWSRRRGYIT AADFVEDRFN
SRLLAALVAL TGIVAELPYI ALQIVGMSAV LAVMLLNVVP GASLEFVSDL ALTIAFIILA
AFTYTSGLRG ATLTAVFKDA LIFLSIIAIL IAAPIAIPGA FHTAFKVASS LNSGKGLPTG
TSSLNSVFSS AYLSLWVGSS LALYLYPHSV NGALSAESRR KLAVSTALLP IYGIGLALLA
LYGILVYADS QALNIIRSFP STGYAAVIKG NLPIPALAAT ILPDWLAGVA LLGIFIGGMV
PAAIMAMAQA NLLTRNVARY VVKLTPQGEA RLAKWASVFF KFLALVFALI PALASVSINL
QLLGGIIITQ TLPPIFLGLV TNRFNKYALM AGWASGILTG VYMFITRYIA TKGASPTLYP
IAGHLYYIAV LALVINILVT LVGTALAVLI KRSAVSKVKA