Gene Cmaq_1338 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCmaq_1338 
Symbol 
ID5710020 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaldivirga maquilingensis IC-167 
KingdomArchaea 
Replicon accessionNC_009954 
Strand
Start bp1413650 
End bp1414990 
Gene Length1341 bp 
Protein Length446 aa 
Translation table11 
GC content43% 
IMG OID641275845 
ProductABC transporter related 
Protein accessionYP_001541154 
Protein GI159041902 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1121] ABC-type Mn/Zn transport systems, ATPase component
[COG1122] ABC-type cobalt transport system, ATPase component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.0193061 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones36 
Fosmid unclonability p-value0.712297 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTGGTGG TGAGTGACCT ATGGGCCAGG TATGGTAATG GGGATTGGGT TCTTAAAGGA 
TTATCCCTAA GTCTTAAGGA TGGGGAGGTA GGGTTAGTGA TTGGGGACAC TGGCTCAGGG
AAGACAACAC TAGTTAGGGC GCTTACTGGT GTAATCCACT TAACCGGTGG TTTAGCTAAG
GGCGTTATTA AGGTTAATGG AGTTGAGTTA AGTGGTGTTG AGCCTAGGGG GAGAAGTAGG
TTAATTGGTG TTCTCTACCA GGATCCAGCA ATACACTTCA CTTACCCGAG GATTGATGAG
GATCTTGAGT TAACGGCCAT TGAGAACAAT GTAGGGGTTA AGGATTTATT GAAGACTGCT
GGCTTGAGTA ATGAGGTATT AGGTAAGTTG GTTACTGAAT TATCAATGGG TCAACTTCAG
AGACTTGCAA TAGCTAAACT GGTTACAAGG GGGGTTAAGG TAATTGTAAT GGATGAGCCA
TTAGCCCACC TGGATCAGGA TGCCGTTGAA TTACTGATTG CATTAATTAG GAGAATTAAG
AGGAGCGGTG TATCAGTGCT AATGCTTGAA CACAGGTATG AGCATATTAT TGGCTCAGTG
GATAAGATTC TTCAACTTAA TGGTGGTAAG TTAAGTGAAC CAAGTAATCT TAGTGTATTA
CATAGGCGAA GATACATTAT TAGAAGCGTA AGTGATAATA CTGATCATAG TTGGCTTAAG
TTAAGTAACG TGTGGTTTAA GTACGATGAC TCATATGTGC TCAGCGGCAT TAACTTAACT
GCATCATCAA GTAGGGTAAT CTTCATAGGG CCTAACGGCA GTGGTAAGTC AACAATACTT
AAGTTAATAC TGGGTGTGGT TAAGCCAAGT AGGGGTAAGG TTACTGTTGA TTACTCACGC
CCTGTGCTTT ACATGCCGCA GGACATAAAT GTGGTGATGT CCATGACTGA TACAGTGGGG
GAATTGTACC TTGAGTTAGC TAAGGCCGCT GGGAGGGATG CGAGTATTGA GGATCTTGAA
CGTGAATTAA AGGCGCTTGA AATTAATATT AATACTTCAG ATGACCCACT ACACCTATCG
GAGGGGCAGA AGAGGGTGCT TATGCTTATT ATGGCTAAGT TACTGAAGCC AACACTAATG
ATTATTGATG AACCAACCTC AGGGCTTTCA GATAGGTATA GGGTTGAGTT AGCTGAATTC
ATTAACCAGT CAAACCTAAG GGTGGTGATG GCTACCCAGG ACCTGCGCTT CGCATCCTTA
ATAAGGGATA GTGACGTATT CTACGTGAGG GGGCGGGAGG GGTATGTTGC TAAGGTGAGT
GTGAGCGAGC ATGTTCTATG A
 
Protein sequence
MLVVSDLWAR YGNGDWVLKG LSLSLKDGEV GLVIGDTGSG KTTLVRALTG VIHLTGGLAK 
GVIKVNGVEL SGVEPRGRSR LIGVLYQDPA IHFTYPRIDE DLELTAIENN VGVKDLLKTA
GLSNEVLGKL VTELSMGQLQ RLAIAKLVTR GVKVIVMDEP LAHLDQDAVE LLIALIRRIK
RSGVSVLMLE HRYEHIIGSV DKILQLNGGK LSEPSNLSVL HRRRYIIRSV SDNTDHSWLK
LSNVWFKYDD SYVLSGINLT ASSSRVIFIG PNGSGKSTIL KLILGVVKPS RGKVTVDYSR
PVLYMPQDIN VVMSMTDTVG ELYLELAKAA GRDASIEDLE RELKALEINI NTSDDPLHLS
EGQKRVLMLI MAKLLKPTLM IIDEPTSGLS DRYRVELAEF INQSNLRVVM ATQDLRFASL
IRDSDVFYVR GREGYVAKVS VSEHVL