Gene Cmaq_1389 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCmaq_1389 
Symbol 
ID5709423 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaldivirga maquilingensis IC-167 
KingdomArchaea 
Replicon accessionNC_009954 
Strand
Start bp1465402 
End bp1466994 
Gene Length1593 bp 
Protein Length530 aa 
Translation table11 
GC content48% 
IMG OID641275900 
Productpermease for cytosine/purines uracil thiamine allantoin 
Protein accessionYP_001541205 
Protein GI159041953 
COG category[F] Nucleotide transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG1953] Cytosine/uracil/thiamine/allantoin permeases 
TIGRFAM ID[TIGR00800] NCS1 nucleoside transporter family 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.461228 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones41 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGGGGAAA CAATAAACTC CCAGTCAACG GTGGTTTACA ATAGAGAGAG GGGTCAATTA 
GAGTTAAAGG TCTCATACCC TGAGGAGAAG TACCTATGGA ACAGTGACCT TCACCCAACG
CCAATACGTA AGAGGACCTG GGGCTGGTAC ACTTATGCAG CAATATGGTT CAGCATGGCC
TTCATAGTGC CAAGTTGGTC ACTGGCAAGC CTCGGCTTAT CCTTTGGCCT AGGGGCGGTG
GAGTCAATAC TAGTCGTCTT CCTAGGCAAC CTAATAGTCC TAGTACCAAT GATAATTCAA
TCCCACGGTG GCGCAAGGTA CGGTATACCT GAACCAGTTT TAACTAGGAC TAGGTGGGGT
GTCTACGGGG CTGTTTTCCC CAGTTGGATA AGGGCTGTGA TCGGTGCCGG ATGGTGGGGT
ATTGAGTCAT ACATTATGAC TGAGGCCGCA GTAGGCATTT ACGCGGTCTT AAGCGGTAAA
CTACCGGTTA TTGAGTCCCT TGTGGCTAAG GGTGTTGCGT CACCATTCAC AATAAGTATA
GCCTTCCCTC AAGTCTTCTG GGTAACCTTC ATTGCAATAA TAATTCTTCA ACTAATCCTA
CTATACCACT CCCCAGTGCC TAATGCTCAA CCAGCCTTAA AGTGGTTTGC AAGGCTCTCG
GCACCGTTAA TACTGGCCGG CTTCCTTGCA CTATGGCTAC ACTTCATGTC AGCATCAGGT
TGGAATTACG GTAACATATT CTCAATACAC AGTAGCCTAA GGGGCTCAGC CTACTGGCTG
GCTTGGTTAG CCTTCCTAAA CGCAAACATA GCCTACTGGG CAACCATGGC CCTATCAATG
CCTGATTACA CCAGGTTCGC TAAGAGCCAG GTGGCTCAAA TGATTGGGCA GGTCCCAATG
CCATTCATGA TGCTTACAGT GGCTGTACTG GGCACCATGA CCACTGGGGC TGTTATGAGG
CTTACTGGTC AACCAATATG GGATCCAATA CTCCTATCAA CACTCTACAT GGGGCCCATT
GCCGGCGTGG TGGTTAATCT ACTATTCCTC CTAGCCACCT TCGCTGTTAA CGTATTCGCC
AACACCGTTG GACCAGCCTA TGACTTCGCC AATACCTTAC CCAGGTACAT AACCTGGTTC
AGGGGTGTTT TAATAGTGGT TGCTGTTGCA GTGCTCCTAG GTGCATGGAC ATACTATGGC
TCAGCCTACG GCTACCTATA CAATTGGCTA CTAACCTACG GTGGATTATT AGGCTCAGTG
GAGGGTATTA TAATATTTGA TTACGCATTA ATAAGGAGGT TTAAGTTTGA GCTTCAGGAC
GTCTTCCTAA GCCACGGTAG GTTCAGGTAC TGGAGGGGGA TTAACCCAGC GGCCTTCATA
ACCTTCGCCG TGGTCACCTT CATAATATAC GCTCCAATAC CGTACCACAG TATCCTATTC
AATAATGCAT GGGTACTGGC CTTCATATTA TCTGGGTTAA TATACACGCC ACTCATGGTT
TACTGGATAA TACCCAAGTA TCAGCCTCAC TTAAAGGGAT CAATATGGAG GGGAGGTTAC
GTATCCAGTG AGGTTAAGGA ATTATTCAGT TAA
 
Protein sequence
MGETINSQST VVYNRERGQL ELKVSYPEEK YLWNSDLHPT PIRKRTWGWY TYAAIWFSMA 
FIVPSWSLAS LGLSFGLGAV ESILVVFLGN LIVLVPMIIQ SHGGARYGIP EPVLTRTRWG
VYGAVFPSWI RAVIGAGWWG IESYIMTEAA VGIYAVLSGK LPVIESLVAK GVASPFTISI
AFPQVFWVTF IAIIILQLIL LYHSPVPNAQ PALKWFARLS APLILAGFLA LWLHFMSASG
WNYGNIFSIH SSLRGSAYWL AWLAFLNANI AYWATMALSM PDYTRFAKSQ VAQMIGQVPM
PFMMLTVAVL GTMTTGAVMR LTGQPIWDPI LLSTLYMGPI AGVVVNLLFL LATFAVNVFA
NTVGPAYDFA NTLPRYITWF RGVLIVVAVA VLLGAWTYYG SAYGYLYNWL LTYGGLLGSV
EGIIIFDYAL IRRFKFELQD VFLSHGRFRY WRGINPAAFI TFAVVTFIIY APIPYHSILF
NNAWVLAFIL SGLIYTPLMV YWIIPKYQPH LKGSIWRGGY VSSEVKELFS