Gene Cmaq_1848 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCmaq_1848 
Symbol 
ID5709594 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaldivirga maquilingensis IC-167 
KingdomArchaea 
Replicon accessionNC_009954 
Strand
Start bp1924733 
End bp1927198 
Gene Length2466 bp 
Protein Length821 aa 
Translation table11 
GC content43% 
IMG OID641276355 
Productextracellular solute-binding protein 
Protein accessionYP_001541655 
Protein GI159042403 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones41 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTAACT ATATACTTAA GGAGAAAGGT TATGTAACAG TAATACTACT GGTAACAGTA 
GTAATCTTAA CTCTGAGCGC CTTAAAAGTA GCCTACGCTC AACAATCCTA CCCAACTTAT
AATATGTATA TGGCTGCTCC ATGGTACTAT GTCCTGCTTC CTCCTTGGCC TGCTCCTTGG
TGGAATCCCT TTGCCTCAGG TAATGTGATA TTCCAAACTG GTACATGGAT GCCTCTTGCC
GATTATAATG ATCCCACTGG TCAGTGGTGG CCTGTTTTGG CTGAGAATTG GACTGTGTTT
CCTCAGAATC AGACCTTGAT AATCTACCTT AGGCATAACT TGTATTGGTT TAATGGTTCA
GCGGTAATAC CCTTCACCGC ATGGGATGTT TACGCATGGT TCTACATTAG TGATAAGGCC
TTTGGTGAAT GGGAGCCTTG GTTAATGCCT CAGAATGCTG ATAAGGACTT CATAATCATA
AACAACTACA CAATATCCAT ACACTTCGAC TACTGGTCAA CCACAGGGTA TTACTGGTTA
CTAATGAGCT GGATACCTAA TACACCATGG CCAGTTTGGG AGAGCGTAGT AAACGAGTTG
AAGACAATGA ATGTAACAGA GGCGCAGAAA TTTGGGTCAG TTAACATAAC AAAGATGGCT
ATACCATACT GGGGTTTAAG CCCATACTAC GTAACCAAGG CTACACCAAA CTATATCACT
GTTCAGCTTG AGCCTGATTA CTTCCAGGGT AAGCCATTGT TGGCTGAGTG GGATAAGATA
CTACCATTCC ACACATGGCA ATACTATCCG CAGATAACAA TAAACATAGC CGTATCAGGT
GGATTAACTC AATTAGTATC ATATGCGTTA AGTGGCCAAC CATACTTCAT ACATACCGTT
GAGGCTATGC CCTATTCATT CCAGAGTAAG TTGAAGAGTG CAGGCTACTA CATACTTCAG
CTACCTGACT TATCCATTGA GGGAATAGCA TTACCAACAT ATTACCCGTT TAATATTCCT
CAGGTTAGGC AAGCATTCCT ATACATCATT AATAGAACTG CTGCAGCTGC ACCATGGTCA
TGGCCTAATG TACCAGTCTT CATTAATGTT CCAGCCCCAG CACCTAATAC CGTGCCTGGC
TTTTGGTTGA CGTTCCCAGC GGATATTAGG AGTATGGTTG TTAACTTCAC TGAACCAAAC
TTAACCAAGG CAATGCAACT CCTAGAGTCA GCTGGCTTAA TTTATAAGAG TGGTCAATGG
TACCTACCCA ATGGAACACC ACTAACACTA ACAATATACG CTAGCTCCAC AGCTCACCCA
GCATGGATTG AAGCTGCGTC AATTGATGCT GAGGCTTTAA CAGGATTCGG TATAAAGACC
ACTGTAGTCA CCATAGAGGG TAGTACTTAT AGTTCTGAGC TTGATAGTTG CCAATTACCA
GATACTGGCA CTGACTGGTT CTTCGCCGGT TCTAATAAGG GTGGTGTTAA TGAATTATGG
ATATACTATG ATGACGCACT TTACACGGCA CCAGGCCTTT TCCCATCCTA CTGTGTCCCT
GGCCACACTA CGCCTTTCGC CTACCCGATT GTTCAGAGTA ATCAGATTAC CGGCTGGTAT
TGTAAACCAT TAACCACCAA CCTGCCTATA CCGAATAACA CCATTGTAAC ATGCGTAAAC
TCAACATACG GATACATTAA CTTGAGTAAT TGGTTAGCTG CAATAGCCGC AGCTGCACCA
GGCACATCAA CATACTATGA GTTAGTTAAG GCATTCTACG CATGGTTCCA ATACTGGGTA
CCTGGAGTGG AGCAGTTCCA GTCTCTAATA GCTTACGCAT TCCCGGAGAA GATGGTTGAT
CCAGAGTGGG TTGTGACATG TATTAATTAT AGTAATCCTG AGTACACTGA GGCAGCATAT
TCGCTGCTTC ATGATTGGGC TATGGGTTGG GGTTGGGATG GATTCTACCC AGGCTTCAAC
ACATGGCTAT TCATGGGTGG ATTCGCACCT CAAGGCGTGA TGCCTCCTTT GGCTGAGGCT
ATTATTAATG GTAGTCTTTG GACTAATCCG TATCTGCATC AGTGGGCTGT CTTAATAGGC
TTACCTAATC CTGATCCTCA GTTGCAGGCT TGTGTTGCAT CATACTTCCA CACAACATAC
ACACCGGTAA CAACCACTAC TTCAACCACA ACCACTACTA CAACAACCAC TACACCGGTG
ACTACTTCAA CTACTACTTC AACAACAACT AGTACCGTCA CCTCAACAAC CACTGCCGTT
GCTACTGTTA CAAGCACAGT AACCAGTACA GCCGTAAGCA CTGTAACATC AACAGCAACC
ACAACAGCAG TATCAACCGT AACCGTGACT AAACCAGTAG TATCAACAGC ATTAATAGCA
GGAATAGTAA TAATAGTAAT CGTAATAGCA GCAGTAGCAG CAATAATAAC ATTAAGAAGA
AGATAA
 
Protein sequence
MSNYILKEKG YVTVILLVTV VILTLSALKV AYAQQSYPTY NMYMAAPWYY VLLPPWPAPW 
WNPFASGNVI FQTGTWMPLA DYNDPTGQWW PVLAENWTVF PQNQTLIIYL RHNLYWFNGS
AVIPFTAWDV YAWFYISDKA FGEWEPWLMP QNADKDFIII NNYTISIHFD YWSTTGYYWL
LMSWIPNTPW PVWESVVNEL KTMNVTEAQK FGSVNITKMA IPYWGLSPYY VTKATPNYIT
VQLEPDYFQG KPLLAEWDKI LPFHTWQYYP QITINIAVSG GLTQLVSYAL SGQPYFIHTV
EAMPYSFQSK LKSAGYYILQ LPDLSIEGIA LPTYYPFNIP QVRQAFLYII NRTAAAAPWS
WPNVPVFINV PAPAPNTVPG FWLTFPADIR SMVVNFTEPN LTKAMQLLES AGLIYKSGQW
YLPNGTPLTL TIYASSTAHP AWIEAASIDA EALTGFGIKT TVVTIEGSTY SSELDSCQLP
DTGTDWFFAG SNKGGVNELW IYYDDALYTA PGLFPSYCVP GHTTPFAYPI VQSNQITGWY
CKPLTTNLPI PNNTIVTCVN STYGYINLSN WLAAIAAAAP GTSTYYELVK AFYAWFQYWV
PGVEQFQSLI AYAFPEKMVD PEWVVTCINY SNPEYTEAAY SLLHDWAMGW GWDGFYPGFN
TWLFMGGFAP QGVMPPLAEA IINGSLWTNP YLHQWAVLIG LPNPDPQLQA CVASYFHTTY
TPVTTTTSTT TTTTTTTTPV TTSTTTSTTT STVTSTTTAV ATVTSTVTST AVSTVTSTAT
TTAVSTVTVT KPVVSTALIA GIVIIVIVIA AVAAIITLRR R