Gene Cmaq_1830 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCmaq_1830 
Symbol 
ID5710082 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaldivirga maquilingensis IC-167 
KingdomArchaea 
Replicon accessionNC_009954 
Strand
Start bp1909330 
End bp1910550 
Gene Length1221 bp 
Protein Length406 aa 
Translation table11 
GC content49% 
IMG OID641276335 
Producthypothetical protein 
Protein accessionYP_001541637 
Protein GI159042385 
COG category[S] Function unknown 
COG ID[COG1602] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones48 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCGGATTA ACCCTGAACT ATGCATTAAA TGCCGTGGGG AGTATAACCT ATGCGGGTTA 
GCGTACTGCC CAATACTAGT GAATAATTGG ACACTTAGGA GGATTAAGCC CCTTGAGGGT
AGGCAGGATG TTAACGGCTC CTCACCCCCA AGCATCATGG TTGGTAGATT AGGTTACCCT
AAGGTTAGAG TCTACCCAGC CACACCACCC ACTCATGGGG ACACAGGTTG GCTTGAGGAA
CCGAGGACAT GGTTAAGCAT GAGGCTTGAG GACTTCCTAT CAAGTAGACT CACGTTAATC
AGGGGGTCAG TAATCTTCAA GGTTAATGAC CCAAGGAACC CACCTAGGCA ACTCCACGAC
ATACAGGTAA TGGCTATTTC AAGCGGACCC GTGGACACTG AGTTAACGCT GGCTAAGCCA
ATTAAAGGTA ATGTAACCTT AAATGAACAG GAACCACCAA TAGGCCCCTC AGCCCCATTG
AGAAGCATTA AATTATCCAC AATACCTCAA CCAAGCAGGG CTGTGGAGAA GGCTTACTCC
GACGTGGATT TAAAGGCTAA TGATGCCGTT TGGATGCTTT ATAACTCAGG CATTGATGTG
CACGTTATCT CAAGGTTAAT GAGCGTAGGC GCCATAGGTA GGGGTAGGGC TAGGAGACTC
GTACCCACTA GGTGGTCTAT AACTGCGGTT GATGAGGAGG TTTCAAGTAG GTTAATCAAT
GAGGTTAAGA ATTACCCTGA GTTAAGCGAG TACAGGGTTT ACGTCAGGAG GAGTAACAAT
AACCTATTCA TAGGCATACT AGCCCCACAC ACCTGGCTAT ACGAGTGGGG TGAGGCATGG
TGGCCTGGGA GCACTTGGAA CACCTGGGGA AGTGAACCAG TGATTGAAAT CGATAGTGAA
GGCTACTGGG GTAGGGACAC CTACCCAAGC ATCGGTGGAT GCTACTACGC AGCCAGGTTA
GCTGCAGCAG AGGCGCTTCA TAGTATGCAT AGGCAGGCAG CCGTAATTCT CTGGAGGGAG
ATTTACCCAG GCTTCAATAT ACCTGTGGGT GTTTGGTTCG TTAGGGAGAA TGTTAGAGCC
ATGTTTAAGG GTAGCTACGT CAGCTTCAGT AGCCTTGATG AGGCGCTTAA ATTCGCCTCA
AGTCAACTTA AACTACCGCT GGCTCAATGG GCCTCAAGAT CCTACGTATT GAGGAGGTTA
AGGGAGGCTA GATTACTATG A
 
Protein sequence
MRINPELCIK CRGEYNLCGL AYCPILVNNW TLRRIKPLEG RQDVNGSSPP SIMVGRLGYP 
KVRVYPATPP THGDTGWLEE PRTWLSMRLE DFLSSRLTLI RGSVIFKVND PRNPPRQLHD
IQVMAISSGP VDTELTLAKP IKGNVTLNEQ EPPIGPSAPL RSIKLSTIPQ PSRAVEKAYS
DVDLKANDAV WMLYNSGIDV HVISRLMSVG AIGRGRARRL VPTRWSITAV DEEVSSRLIN
EVKNYPELSE YRVYVRRSNN NLFIGILAPH TWLYEWGEAW WPGSTWNTWG SEPVIEIDSE
GYWGRDTYPS IGGCYYAARL AAAEALHSMH RQAAVILWRE IYPGFNIPVG VWFVRENVRA
MFKGSYVSFS SLDEALKFAS SQLKLPLAQW ASRSYVLRRL REARLL