Gene Cmaq_1463 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCmaq_1463 
Symbol 
ID5709346 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaldivirga maquilingensis IC-167 
KingdomArchaea 
Replicon accessionNC_009954 
Strand
Start bp1540444 
End bp1541652 
Gene Length1209 bp 
Protein Length402 aa 
Translation table11 
GC content45% 
IMG OID641275972 
Productpolysaccharide biosynthesis protein 
Protein accessionYP_001541277 
Protein GI159042025 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.0435937 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value0.00981325 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAGCAGTG AATCCTCGGT GTCTGGGGTT ATTACTGGAT ACTTAGCATC CGCCGTGATA 
TACGTCATAA TACTAACCAG GTTAATACCC CTGACACAGT ACGGTTATTA CAACTCACTC
CTAGCAATGA TGGGGATATT CTCACTCTTC TTCCCAACCC TGGGCATCGA CGTGGCTATT
GCCAGGGAGG CTGCCATGCT CCATGCCAGG GACATGCCCT TTGAGGGACA CATGGCCGCC
ATACTCCTCA TCTCAATAAT ACTGACCACC GCATACTCAC TAACGTTATT CCTCGCAATA
CCCCTGTACA TAATTAGTAA GATACCCAGT TACTACCTGG GTATTGTTTA CATATACATT
GCCTGGATAA TAACCCAGGC ATTTACCGGC GTTCTCTCAA CATACCTATG GATAATGAGC
AAGCTCAGGT CCCAGGGTGT TGGGAATATG CTCTACAGCC TTGTCTTTAG GCCCCTTGAG
GTTGCCCTAT TAGTGGTAAT GCACAGTGTC TACGCGATTA TAATATCCAT ACTAATTGGT
CAATTAACAG CGCTCCTCTA CTACATGTTA ATTATAAGGC GATTACCAAA CCCACTGAAG
GGCTTGGCTC TGATAAAGAA TGGGCTTAGA AGGTACCTCA ACACGGGCTT TCAAAACTGG
ATAATCAGTT ACATAGGCTC AATAGGGGGT TACGCATTAA CATACCTAGT GTACCTATCC
CTAGGCCCTG AGTACGTGGC TATATACAAC CTAGTAACAT ACATGCTCGG CGCAGTAACA
ACATTAACTG GTTCAGTGAG TAACGTATTC AGTAGTAAAC TTTCACACGT GATAGGCGCC
GGCGGTGATA CAAAGGCCTT AGTAAGGGAT TATGCAATCT CCATTATAGT GACCAGCGGC
GTACTATCGC AGTTAGCCAT GTTAACCATC CCACTGCTTC CTATCCTGAG TATTGTGCAT
GGTGATTACG TGAGATCCAT ACCCTATGCG ATGTTGTTAC TAGCCTCAGC GGTGATTTCG
GCACCCGTGA GTATATACAC AGTGTATTAC TGGGTCCTTG GTAAGGGTTG GCATTCAGTT
AAGATCTCAG CATTGGGGGT TACCGTGGGT CTTTTAATAT TCATAATCAC TGTTAAGTAC
CTGGGCTTCT ACTCAGTAAT CCTTTCATCA TACGCATCCT CAATCTCCCC ATTAATCGCA
TTCATATAA
 
Protein sequence
MSSESSVSGV ITGYLASAVI YVIILTRLIP LTQYGYYNSL LAMMGIFSLF FPTLGIDVAI 
AREAAMLHAR DMPFEGHMAA ILLISIILTT AYSLTLFLAI PLYIISKIPS YYLGIVYIYI
AWIITQAFTG VLSTYLWIMS KLRSQGVGNM LYSLVFRPLE VALLVVMHSV YAIIISILIG
QLTALLYYML IIRRLPNPLK GLALIKNGLR RYLNTGFQNW IISYIGSIGG YALTYLVYLS
LGPEYVAIYN LVTYMLGAVT TLTGSVSNVF SSKLSHVIGA GGDTKALVRD YAISIIVTSG
VLSQLAMLTI PLLPILSIVH GDYVRSIPYA MLLLASAVIS APVSIYTVYY WVLGKGWHSV
KISALGVTVG LLIFIITVKY LGFYSVILSS YASSISPLIA FI