Gene Cmaq_0464 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCmaq_0464 
Symbol 
ID5708785 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaldivirga maquilingensis IC-167 
KingdomArchaea 
Replicon accessionNC_009954 
Strand
Start bp503032 
End bp504303 
Gene Length1272 bp 
Protein Length423 aa 
Translation table11 
GC content43% 
IMG OID641274967 
Productsolute binding protein-like protein 
Protein accessionYP_001540299 
Protein GI159041047 
COG category[R] General function prediction only 
COG ID[COG3889] Predicted solute binding protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones32 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value0.153689 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGGCAA TAGGTAAGTG GGTTACGTTC TTATTAGCAA TAATGATCGC AGCCAGTCTC 
CTAGTTACAT ATGCCCAGCA AGTGGTGCCG CTGGTTCCAG CCTTTAACAC TACGTCAAAC
GTAGTGGTTA TTGGCAATAT GTCTAGTCAA TACGTCTCAA AGCTTCTTGG TTACGCCTTA
TTCTTTACTA ATCTTTGGGA CATGGGCTCA GGTACCTCGG GTTCAACTAT CTTAATGTAT
AATGCTGCTG AGGGAGTGGC ATATTATATA CGGAATTACA CCTTCATTAA GCCTACATAT
TCATACGATG TCACTCTTGG TTACCCTGCA ATATTCTATG GTCATTCACA GTGGGGTGGC
TTATATCCTA AGATGAGTCC ATTACTAATG CTTCCAGCTC AAGTCATATC GCTTCCAGGT
ATTTGGGCTT ACGTGAACTA CAGTACGGTT AAGTTCACTC CCGGTAACCT GCAGTTTGAC
TTATCACTAC AGGTTTGGCT GGGGGCAACG CCTAATGAGA CCAGTGGAGC CCAGCCAGGT
GATTTAGAGG TGATAATATA CTACTATACG CATAATATGG CCCCGGGTGG TTCAAATATG
GGTACTATAA CTGTGCCTAC GTTTGTTAAC GGTAGTATTG TTGATGAGAG TTGGCAGGTT
TGGGTTTACT ACGGTGGATC AAGCATGTGG ACTATAGCTT GGTTTGTACC AAGTATTAAT
CAGCCAAGCG GCTACGTTGG ATTAAACATA ACTGGTATGC TCTACGACTT ATTCAATATA
CTAGTTACCA ATTGGCCGCT GCAGTGGAAC TTCACTGGGT TACTGAACTA CTACTTATTC
CAGCTTGGTA ACGACTTCGG TACATCAAGT CAATTAAGTA ACGTCTTCGA GAACATAACA
GTATATAAGT ATTACCTTGA GTTAACTAAG CCATTAATAC CGATAATCAC GGTAACCAAC
ACAACAACAA CCACAGTCAC CACCACGGCA ACAACCTACG TAACATCAAC AGTAACCAGC
ACAACCACTA CCACTGTAAC CTCAACCACA ACCTCAGTTA GCACTACCAC TGTTACTTCA
CCTGTGACTA GTACGACAAC GTTAACAACC ACATCACTCT TAACCACCAC TAGTACCGCA
ACATTGGTAA GCACTTTAAC CAGTACCTTA ACAGTTACTA AGGAGGTTAT TGGGACAAGC
GTAATAGTGG GCATAGTAAT CATAGTAATC GTAATAGCAG TCATTGCGGC GCTGCTGGTT
AGGAGGAGGT AG
 
Protein sequence
MPAIGKWVTF LLAIMIAASL LVTYAQQVVP LVPAFNTTSN VVVIGNMSSQ YVSKLLGYAL 
FFTNLWDMGS GTSGSTILMY NAAEGVAYYI RNYTFIKPTY SYDVTLGYPA IFYGHSQWGG
LYPKMSPLLM LPAQVISLPG IWAYVNYSTV KFTPGNLQFD LSLQVWLGAT PNETSGAQPG
DLEVIIYYYT HNMAPGGSNM GTITVPTFVN GSIVDESWQV WVYYGGSSMW TIAWFVPSIN
QPSGYVGLNI TGMLYDLFNI LVTNWPLQWN FTGLLNYYLF QLGNDFGTSS QLSNVFENIT
VYKYYLELTK PLIPIITVTN TTTTTVTTTA TTYVTSTVTS TTTTTVTSTT TSVSTTTVTS
PVTSTTTLTT TSLLTTTSTA TLVSTLTSTL TVTKEVIGTS VIVGIVIIVI VIAVIAALLV
RRR