Gene Cmaq_1471 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCmaq_1471 
Symbol 
ID5709489 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaldivirga maquilingensis IC-167 
KingdomArchaea 
Replicon accessionNC_009954 
Strand
Start bp1548027 
End bp1549178 
Gene Length1152 bp 
Protein Length383 aa 
Translation table11 
GC content43% 
IMG OID641275980 
Productglycosyl transferase group 1 
Protein accessionYP_001541285 
Protein GI159042033 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value0.630233 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value0.0889822 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGGTTT TGTTTATCAC ATACGGCCTT AGCATTGCTG GTGGTAATAG GGCTATTTTC 
GAGGTTGCGA ACAGATTAAG TGATAGGGGT TACTACGTTG AAGTACTTGC ACTTGGCGGT
GACCACTCAT GGTTTGACGT CAAGGTTTCT GTTAAGTACC TCGAATTACC CAAGAGAATG
CGCACCTTAC TTAGGGCATA TTCGTTGATT AAGTACTTAA GGGTTAAGGA GTATAAGTAC
ATCGATGTTA ACACCTTTGC CAAGAGGCTT GGTTTTAGGG TTGACCTTAT TAGGCCCTTG
GCTGAGGCAA TACCAAATAA TTTTGATGCA GCAGTAGCCA CCTATTATCC AACGGCATTA
TCATTATGGC TCTCAAGACA TGAAGGCTTA AGACTATATT TCGTACAAGA TTTCCCAGAA
TTAGCATCAG CAGATGGTCA TTACGGCTTA AGGCTTTGGG ATCTAACACT TAGAATACCA
TTTAACGCAT TCTTAGCAGT ATCGACATAC ATAAAGAATT TGATACTAGA GAGGCAATAT
AATGCAAAGA TCTTAGTGAC TGGTGCTGGT GTTGATATTA ACGTATTTAA GCCAAGTAAG
GATAAATTGG TTGATGTTAA GGGTAAATAT AAGGTAATGA CTATCATAGG ATCTAATCCG
TGGAAAGGCG CTGATGTTGC CATTAGGGTT TTGAGTGAGG TCTCTAGGAA ATTGCCAATA
CACGCAATCC TAGTTGGGGA TAGGGAATTC GTAGATTTGT TAATAAAAAG TGTTAAGCAT
AGCTTCACAT ACACAGTATT CAGCAACGTG CCCGACGACC TACTGGCCAG GCTGTACAGC
AGTGCCGATG CATTCCTATT TACATCATAT GTCGAGGGCT TTGGACTACC ACCGCTTGAG
GCCATGGCTT CAGGAACTCC CGTTGTGACT ACGGATTGCC TCGGTAATAG GGACTACGTG
ATTGACGGCG TGAATGCATT GGTTGCTAAG CCGGGCGATG TCGAGGGGCT AGCAAACTCA
CTAATCAAGA TACTCATGGA CGAAAAGCTC AGGGAGAGAT TAATCGAGAA TGGACTCAAA
ACCGCGAAGC AATGGAGCTG GGATAAAGTT GTGGATAAAT TTGAAGAAGC AATTAAGGGT
GAGTCACAAT GA
 
Protein sequence
MKVLFITYGL SIAGGNRAIF EVANRLSDRG YYVEVLALGG DHSWFDVKVS VKYLELPKRM 
RTLLRAYSLI KYLRVKEYKY IDVNTFAKRL GFRVDLIRPL AEAIPNNFDA AVATYYPTAL
SLWLSRHEGL RLYFVQDFPE LASADGHYGL RLWDLTLRIP FNAFLAVSTY IKNLILERQY
NAKILVTGAG VDINVFKPSK DKLVDVKGKY KVMTIIGSNP WKGADVAIRV LSEVSRKLPI
HAILVGDREF VDLLIKSVKH SFTYTVFSNV PDDLLARLYS SADAFLFTSY VEGFGLPPLE
AMASGTPVVT TDCLGNRDYV IDGVNALVAK PGDVEGLANS LIKILMDEKL RERLIENGLK
TAKQWSWDKV VDKFEEAIKG ESQ