Gene Cmaq_1248 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCmaq_1248 
Symbol 
ID5709400 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaldivirga maquilingensis IC-167 
KingdomArchaea 
Replicon accessionNC_009954 
Strand
Start bp1318073 
End bp1319395 
Gene Length1323 bp 
Protein Length440 aa 
Translation table11 
GC content43% 
IMG OID641275753 
Producthypothetical protein 
Protein accessionYP_001541065 
Protein GI159041813 
COG category[L] Replication, recombination and repair 
COG ID[COG0675] Transposase and inactivated derivatives 
TIGRFAM ID[TIGR01766] transposase, IS605 OrfB family, central region 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value0.00466791 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGTGGAGTG GGGGCCGAAA TACCCTCACT ATGCCAGACG TGGGTACCAC TAGAACAGTG 
GTTGTTCGCC TTCTACCAAA TGATGTACAG GAGGGGGAAC TAGGACGATT AGCTAACGCC
TCAACATCAC TTTTCAACGA AGTGAATTAC GAAAGAAGAC GGCGATTCTT CAACAAGCAG
AAGATGGATT TCAAAGGAAC GTATAAGAAA TACTACGAGA AGTACAAGGG AATACTAAAG
GTGAATGCAC AAGCAGTTAT TCAAAAGAAT AATGAAGCGT GGTCATCATT CTTCTCTCTC
CTGAAGAAGG GTGAGAAAGC CTCCCCACCA GGCTATTGGA AAAGAGGAGG GGGAAGAGTG
TTAATCCTTG TTGTGAGACA GGATAGGTAC TACGTGGATG TTGAGAACCA CAAGCTAGTG
TTGAGGGACT TTAAACTAGA GATTCCCTTC GCCGGGAGAG TGAGGTGGTT TGGTAAACAA
GGTAGGCTAG AGATTCATTA CGATGATACT CGGAACAGGT GGTATGCATA TATTCCAGTT
GAGGTTGGTG TTACAACAAC ACGGACTGGA AAAGAGAGTA AGTTCATAGT TAAAGGGGAA
AGGAAAGGGA TTCAGCTTTA TCAACCGAAA GGAAATAAGG TGGCGTCTGC TGACCTAGGC
ATAAACATTC TAGCTAGTGT TGTTGTGAAT GATGGTACTT GGATTCTCTA TAAGAGTAGA
GCTAAGGAGG ATTACTTCTA TTTTCAGAGG AGGATAGCTG AGGTACAATC AATAGTAGGC
AAGGCTAAGA ATGCTGGTGA GCTAGAGGCT TATGAGGAAG CAAGAAGAGA GGAAGGAAGA
TTATATGGAA AGTTGTACCG TCGCCTTCTC CATCTGTATA GGAGCTTCGC ATCTCATCTA
ATGAAGACGT TGTACGAGAT GGGTGTGTCA ACCCTCATTG TTGGGTATCC TTACCTCATT
GCACAAGATA AAGGTAACAA GTTCACAGTG AATATGTGGT CTTACTCAAA ACTATTTGAG
GCTATTCTGT TGAAAGCCCA AGAGTACGGT ATTAAGGTCA TGAAGGTTGT GGAGTATAAC
ACATCTAGAG TATGCGCCTT TCACGATGTT GAAGTTGTGA GGAAACCTAG GGGAGTAATT
TCATGTCCAC ATGGTCATAA ACTACACGCA GACTTAAATG GAGCATTAAA CATCATGAAA
CTAGGAGTAG GAATAGTCAT AAACGAAGTG AAAAACCCCC TCTCCTTCTT TATTGATCAT
AACCAAGTAG CCCCCACAAA GGGGGGTAAC ACCCAAGACC CCAACGAAAC CCCCACCCTT
TAA
 
Protein sequence
MWSGGRNTLT MPDVGTTRTV VVRLLPNDVQ EGELGRLANA STSLFNEVNY ERRRRFFNKQ 
KMDFKGTYKK YYEKYKGILK VNAQAVIQKN NEAWSSFFSL LKKGEKASPP GYWKRGGGRV
LILVVRQDRY YVDVENHKLV LRDFKLEIPF AGRVRWFGKQ GRLEIHYDDT RNRWYAYIPV
EVGVTTTRTG KESKFIVKGE RKGIQLYQPK GNKVASADLG INILASVVVN DGTWILYKSR
AKEDYFYFQR RIAEVQSIVG KAKNAGELEA YEEARREEGR LYGKLYRRLL HLYRSFASHL
MKTLYEMGVS TLIVGYPYLI AQDKGNKFTV NMWSYSKLFE AILLKAQEYG IKVMKVVEYN
TSRVCAFHDV EVVRKPRGVI SCPHGHKLHA DLNGALNIMK LGVGIVINEV KNPLSFFIDH
NQVAPTKGGN TQDPNETPTL