Gene Cmaq_1010 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCmaq_1010 
Symbol 
ID5709406 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaldivirga maquilingensis IC-167 
KingdomArchaea 
Replicon accessionNC_009954 
Strand
Start bp1060214 
End bp1061362 
Gene Length1149 bp 
Protein Length382 aa 
Translation table11 
GC content48% 
IMG OID641275511 
Producthypothetical protein 
Protein accessionYP_001540831 
Protein GI159041579 
COG category[C] Energy production and conversion 
COG ID[COG1139] Uncharacterized conserved protein containing a ferredoxin-like domain 
TIGRFAM ID[TIGR00273] iron-sulfur cluster-binding protein 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.12516 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones38 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGGTATG ATGAGGCAAT TAGGAGGGCT GTGGAGGCTA ATGTGCCTAG AACCATGAGT 
ATTCTCGATA AGTACCCATA CGTAACCGAG TTGGCTAAGG AGCTTAGGAA GGCTAAGGAG
GAGGTTATCA GAAACCTGGA GTACTACGTG GATAAGGCTA TGAAATCCAT TCAAGCCATA
GGGGCTAAGG CGTACTTCGC AAGGGATGGC GATGAAGCCA GGAGAATAAT AGGTAATATT
GTGGGTAAGG GTAATGTTAT AGTCTTAGGT AAGACAATGG TTGGCAGTGA GATTGGGCTT
AGGGAATACT TAATCAGCAT TGGTAATGAG GTTTGGGAAA CCGACTTAGG TGAATTCCTA
ATACAGTTAA CCGGGGATAA GCCAACCCAC ATAGTTGCCC CAGCCCTACA CATGACCAGG
GAGAGGGCTG CCAGGGTTAT TAAAGAGAAG TTAGGCATAG ATGTTAAGGC CGATCCATCT
GAAATAGCCC AGACAGCTAG AAGATTCCTG AGGGATAAGT TCTTTAAGGC TAACTTTGGG
ATAACCGGAG CAAACGCAGT GGCCGCCGAC ACTGGGGCTG TGCTGCTTAT TGAGAATGAG
GGTAACATAA GGTTCACCAC AGTGTCACCG CCGGTTCACA TAGTCTTAAC AGGTATTGAT
AAGATAGTCC CAACACTGCA TCACGCATTC ATGGAGGTTA TGGTTCAAAG CGCCTACGCT
GGACTCTACC CCCCAACTTA CGTTAACCTA GTGGCTGGAC CATCAACAAC AGCTGATGTT
GAGCAGACTA GGGTTTCCCC CTCACACGGG CCCAGGGAGG TTCACGTAAT CCTCCTCGAT
AATGGTAGGT TAAGGGCCTC TAAGGATGAT TTACTTTGGG AAGCACTACT GTGCATTAGA
TGCGGTAGAT GCCACTTCCA TTGCCCAGTC TACAGGGCTT TAGATGGTTC ATGGGGTGAG
TCACCCTACG TGGGGCCAAT GGGGGTTATG TGGACTGCTG TGGTTTATGG AATTGAGAAG
GCTGGTCCAC ACGCAATGTT ATGCATGCAT GCTGGTACAT GCCGTGAAGC ATGCCCAATG
AAGATCAACA TCCCTGAAGT AATACAGGGT ATTAAGGCAA GGTACACTAA ACTAGTGGCT
AAGCGGTAA
 
Protein sequence
MGYDEAIRRA VEANVPRTMS ILDKYPYVTE LAKELRKAKE EVIRNLEYYV DKAMKSIQAI 
GAKAYFARDG DEARRIIGNI VGKGNVIVLG KTMVGSEIGL REYLISIGNE VWETDLGEFL
IQLTGDKPTH IVAPALHMTR ERAARVIKEK LGIDVKADPS EIAQTARRFL RDKFFKANFG
ITGANAVAAD TGAVLLIENE GNIRFTTVSP PVHIVLTGID KIVPTLHHAF MEVMVQSAYA
GLYPPTYVNL VAGPSTTADV EQTRVSPSHG PREVHVILLD NGRLRASKDD LLWEALLCIR
CGRCHFHCPV YRALDGSWGE SPYVGPMGVM WTAVVYGIEK AGPHAMLCMH AGTCREACPM
KINIPEVIQG IKARYTKLVA KR