Gene Cmaq_1351 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCmaq_1351 
Symbol 
ID5710270 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaldivirga maquilingensis IC-167 
KingdomArchaea 
Replicon accessionNC_009954 
Strand
Start bp1428256 
End bp1429344 
Gene Length1089 bp 
Protein Length362 aa 
Translation table11 
GC content44% 
IMG OID641275858 
Productglycosyl transferase family protein 
Protein accessionYP_001541167 
Protein GI159041915 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1215] Glycosyltransferases, probably involved in cell wall biogenesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones35 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones42 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTGATAC AATTACTCTA CATGGGTTTA CTCCTAGTAC TGATACACGT TGCGGTACCG 
TTAATTTACT ACATTATTAT CCTAACCTAC GCTAGGAGGC CTTGGTTAAT TAACTCAATT
AACGTTAATG ATGGTGAATT ACCCGCGGTA TCCATCATAA TACCTACATA TAATGAGGAG
AACATGATAC TGGGGAAGCT GGATAATATT CTTGAACAGA ATTACCCCCT GGATAAGATC
CAGTTAATAA TATCCGACTC AAGCAGTGAC AATACTCAGG TTAAGGTTGA GGAGTGGTTG
AGTAGGCATA GGGGAGTTAA CTTAAGTTAC ATTAAGGGCC CCAGGATGGG TAAGGGCCAT
GCATTAAATA AGGCGTTGGA GGCTGCGTCG GGTAGTATTA TAGTGACCAC TGATGCTGAT
TCACTTTGGG TTAAGGACTC ATTAATTAAC GCCGTTAAGT GGCTTAGTAA TGAGCAGGTG
GGTTTGGTTT CATGCGTAAA GGTACCTAGG GGTGGTGGAT CAACTGAGGA TGCCTATAGG
AGGCTTTACA ATACCTTGAG GATTGGGGAA AGTAAGATAC ACTCCACTGT TGTTTTCCAC
GGTGAATTAC TGGCTGTTAA GGGGGATTTA ATTAGGAGTA TTGGTGGTTT TCCAACGGAT
ATTGGTGCAG ATGACTCATA TACGGGAGTT AGGGTTGCCT CAATGGGTCT TAGAGCCGTG
ATTCCGGAGA ACGTGGTTTG CATGGAGTAT GTTCCAAGTA ATGGGTATAG TAGGTGGAGG
GTTAGGAGGG CTCAACACCT ATTGCAGAGC TTCATGAAGT CAATTAAGTT ACCTAAACCA
AGCAATTATA AACCAATCTA CTACACTGAA GCCTACATTC ACCTAATGAA CCCATGGCTA
CTCCCAATTG GCGCAATCCT GCTCCTAGCC TCAGGGAGCC TGTGGGCATA CGCCTTAATT
GCAGTGGGTT TAGTATTATT AGTGTGGTCA CCCTTCAGGG CTTGGGTAAC GCAGCAATTC
ATACTGATGT ACGCCATGGT CAGGAACCTG TGGACTAAGG AATTAATGTG GGAGAAGATT
AGTAAATAA
 
Protein sequence
MLIQLLYMGL LLVLIHVAVP LIYYIIILTY ARRPWLINSI NVNDGELPAV SIIIPTYNEE 
NMILGKLDNI LEQNYPLDKI QLIISDSSSD NTQVKVEEWL SRHRGVNLSY IKGPRMGKGH
ALNKALEAAS GSIIVTTDAD SLWVKDSLIN AVKWLSNEQV GLVSCVKVPR GGGSTEDAYR
RLYNTLRIGE SKIHSTVVFH GELLAVKGDL IRSIGGFPTD IGADDSYTGV RVASMGLRAV
IPENVVCMEY VPSNGYSRWR VRRAQHLLQS FMKSIKLPKP SNYKPIYYTE AYIHLMNPWL
LPIGAILLLA SGSLWAYALI AVGLVLLVWS PFRAWVTQQF ILMYAMVRNL WTKELMWEKI
SK