Gene Cmaq_1204 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCmaq_1204 
Symbol 
ID5709800 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaldivirga maquilingensis IC-167 
KingdomArchaea 
Replicon accessionNC_009954 
Strand
Start bp1268692 
End bp1270251 
Gene Length1560 bp 
Protein Length519 aa 
Translation table11 
GC content44% 
IMG OID641275708 
Producttype II secretion system protein E 
Protein accessionYP_001541021 
Protein GI159041769 
COG category[N] Cell motility
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG0630] Type IV secretory pathway, VirB11 components, and related ATPases involved in archaeal flagella biosynthesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.431618 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value0.000463314 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGTCTAGCA TTAGCTTAGG ATTTGAGAGA ATGATTGATG TTAAGGGTCA GTATAGGTTG 
ATTGAAAGGT ACCCTGTTAA TGAGCCTTAC GCCTACGTTA ACATAATGGA GAACACTGAG
ACTGGTTCAA TAATGTACTA CGTTGATGAG GTCGCCTTAA CCACAAGTGA GAAGAGGGTT
TACGGTAATT TACTAAGGAT AATAATGAGT GAATTACCCC CACCTGAACA GATAGGTGAC
GTTAATTCAG TTAGGCAGTT TTTAGCTAAT AAGGTTAGGG AATTGATACG TAAGTACAGG
AGGTATCTTA GACTATCCCC CAATGCTCAA TCAACCCTAG CCTACTACCT TGAACGTGAT
CTACTGGGCT TTGGCCCAAT TGACCCCCTC ATGAGGGATG AGAATATTGA GGATATTTCA
GCTGATGGGG TTGGGAAGCC GATATACGTT TACCATAAGG ATTACGAAAG CATACCAACA
AACATAGTCC CCCTTAGTGA TGAGGCTATT GATGACTTAG TCGTTAAGTT GATACATATG
GCTGATAGGC ATGTATCAGT CGCAACACCA ATAGTTGATG CCCAGCTTCC CGACGGCTCA
AGGATTGCGG TAACCTATAG GCGTGAGGTC TCACCCAGTG GCTCAACTTT CACCATAAGG
AAGTTTAGGT CAAACCCATT CACCTTCACT GAATTGGTCA CTAATGGTAT GATTAGCCCC
GATATAGCTG GCTACTTCTG GACGATGATG GATTACCATA AGTCATTCAT GGTGATAGGA
GTAACCGGTG CCGGTAAGAC AACGTTCCTG AACGCAATGG CAACCTTCAT TAGGCCTAAC
ATGAAGATAA TAACTGTTGA GGAGGTACCC GAAGTTAAGT TACCTCACCA GAATTGGATT
AGGCTAGTTC CAAGGCTCTC CTTCGGTCCC CAGAAGACCT CTGAAATAAC CATGTTTGAC
TTGGTTAAGG CAACCTTAAG GATGAGGCCT GACTACCTAA TAGTGGGTGA GGTTAGGGGT
GAGGAGGCCT ATGTGCTTTT CCAAGCAGTC TCAACAGGCC ACGGTGGCAT ATCAACAATG
CACGCTGAGA ACTTTGATGC AGCCAAGAAT AGGTTGATGA GTCCGCCAAT GAATATACCC
GCAGCCTACA TACCCTCAAT GAATATCTTC GTAATGATTA GGAGGATAAG GATGATTAAG
GATGGTCGTG AAAGAGTCGT GAGGAGGGTT ATTGAGATAG GTGAACCCGT ACTGGATAAT
GGGGATGTTA AGTTCATAAC AGTGTTCAAG TGGAATCCAG TACTGGATAG GCATGAATCA
TACCTAGATA GAAGTGTTTT AGTCAGGGAT ATTGCGGAGG AGAGGGGTGT TAGGCCAAGT
GATGTTATTG AGGATATTAA GACCAGGGCA TCCATAGTTA GGTGGATGGT TGATAACGGT
ATTAAGGACT TCGATAAAGT GGCGAGAATG GTGGAGTTAT ACTATAATAG GCCTGAGCAA
GTGTTATCAA TGGTTAGGGA GAAGGCAACT GCACAGCCCA TTGCCCCAGC TCAGCGTTAA
 
Protein sequence
MSSISLGFER MIDVKGQYRL IERYPVNEPY AYVNIMENTE TGSIMYYVDE VALTTSEKRV 
YGNLLRIIMS ELPPPEQIGD VNSVRQFLAN KVRELIRKYR RYLRLSPNAQ STLAYYLERD
LLGFGPIDPL MRDENIEDIS ADGVGKPIYV YHKDYESIPT NIVPLSDEAI DDLVVKLIHM
ADRHVSVATP IVDAQLPDGS RIAVTYRREV SPSGSTFTIR KFRSNPFTFT ELVTNGMISP
DIAGYFWTMM DYHKSFMVIG VTGAGKTTFL NAMATFIRPN MKIITVEEVP EVKLPHQNWI
RLVPRLSFGP QKTSEITMFD LVKATLRMRP DYLIVGEVRG EEAYVLFQAV STGHGGISTM
HAENFDAAKN RLMSPPMNIP AAYIPSMNIF VMIRRIRMIK DGRERVVRRV IEIGEPVLDN
GDVKFITVFK WNPVLDRHES YLDRSVLVRD IAEERGVRPS DVIEDIKTRA SIVRWMVDNG
IKDFDKVARM VELYYNRPEQ VLSMVREKAT AQPIAPAQR