Gene Cmaq_1087 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCmaq_1087 
Symbol 
ID5710394 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaldivirga maquilingensis IC-167 
KingdomArchaea 
Replicon accessionNC_009954 
Strand
Start bp1140431 
End bp1141777 
Gene Length1347 bp 
Protein Length448 aa 
Translation table11 
GC content40% 
IMG OID641275586 
Producthypothetical protein 
Protein accessionYP_001540905 
Protein GI159041653 
COG category[R] General function prediction only 
COG ID[COG2425] Uncharacterized protein containing a von Willebrand factor type A (vWA) domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.6129 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones51 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGTCAGA GCAGTGAAGA GGATTACGGT ATACTACTTA ACGTTGATTA TGATGAGGAG 
TTAACCCGGT ATAGGCTTAG GGAGGTTTAC CTAATGGCTA AGCGCCACAT GATACCTAAC
CTTGATAAGT TAAGGAACTA CATACTCGCC GACTCATTCT ACATACACTA CAGGCAACCA
ATACTTAGGA GTGGTAATGA TAGTAATAAT TATAGGGCAT TATGGAGAAT GATAGTTAGC
AATTACATTA ATAGTGAGCA GTATGCTGAA GTCTCCAGGT TAACTAAACT AAATGGTAGG
TTATCCAGGA TAATGGCGGT TAAGTTACTT AGAATATACG TTAACATGCT TAATAGGATT
GAAAGGAACG AGAGACTTGC CAATGCATTG AGGGATGCGT CAAATAAATC GTCGCAATCA
TCAAGGGAGA GTAGAAATAT GCTTGATAGG GAGATTAGAT CATTAATCTC ATTTTACATG
GGGAACATGA AGAAGGTTAA TGACACTATT AATAAGGCTC GCTCCGTTCT AGGACCGGCT
ATAGGCCATG AGGTTGCTGA ATTAATACTG GATACGGACA TAGACCCATA TAGGGCTAGG
TTAGTTAACA TGCTTAATTC ACTGCTTAAG CTAGTCACGG AGTCAAGCCG CATGTATGAT
GAGGGGATAC TTAATGAGAT GCTTGATAAG GGGGTTATGA CTGGTATTAA GAGGATGGAT
AAGGTTAATG AGGTTAAGGA CCTAACCCCA ACCAATAGGG CATTGGCTAA GTTCGCTAAA
CCAATATACG CCTACAAGTT AGCCACAGGG AGCTTAACGG TTAAGGAGAG GAAGATGATG
AGGAAGCCTA AAATATACCT GGTTATAGAT AAGAGCGGCA GCATGTTCTA CACAGTTATG
AATAACATAT TTGAGTACAG TAGTGTGAGT AAGATAACCT GGGCAGCCGC ATTAGCCATA
GTAATGGTTA TGAAGGGTCA TGAAGTCGTG GCAAGGTTCT TCGACCAGCA GATATATCAA
TTAATGACTA ATAAGAAGGA CATAATAAAA ACACTCCTAT CCCTGGTCCC ATTAGGTGGA
ACAAACATAA CCTCAGCCAT TAGGGTAGCC TACGATGATG CTCACAGGAA CCCAGCCTTA
AGGAATTATA AACTAGTGCT AATAACTGAT GGTGAGGATG ATGAACTTGA CTTAGCGTTA
ATTAAGCAGG GTTTATCAGG TTTCAGTGAC TATAGGATAA TACTAATAGG TAATGAACAC
TCAGCCCTTG AAATGCTTGG ACCAAAGGTA ACTAAGATAC ATAACCTTAA CGCTAGATCA
TTAATAAGCG TACTAAGAAA AATATAA
 
Protein sequence
MGQSSEEDYG ILLNVDYDEE LTRYRLREVY LMAKRHMIPN LDKLRNYILA DSFYIHYRQP 
ILRSGNDSNN YRALWRMIVS NYINSEQYAE VSRLTKLNGR LSRIMAVKLL RIYVNMLNRI
ERNERLANAL RDASNKSSQS SRESRNMLDR EIRSLISFYM GNMKKVNDTI NKARSVLGPA
IGHEVAELIL DTDIDPYRAR LVNMLNSLLK LVTESSRMYD EGILNEMLDK GVMTGIKRMD
KVNEVKDLTP TNRALAKFAK PIYAYKLATG SLTVKERKMM RKPKIYLVID KSGSMFYTVM
NNIFEYSSVS KITWAAALAI VMVMKGHEVV ARFFDQQIYQ LMTNKKDIIK TLLSLVPLGG
TNITSAIRVA YDDAHRNPAL RNYKLVLITD GEDDELDLAL IKQGLSGFSD YRIILIGNEH
SALEMLGPKV TKIHNLNARS LISVLRKI