Gene Clim_1030 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagClim_1030 
Symbol 
ID6353732 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium limicola DSM 245 
KingdomBacteria 
Replicon accessionNC_010803 
Strand
Start bp1135937 
End bp1137475 
Gene Length1539 bp 
Protein Length512 aa 
Translation table11 
GC content61% 
IMG OID642668653 
ProductAAA ATPase central domain protein 
Protein accessionYP_001943084 
Protein GI189346555 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0464] ATPases of the AAA+ class 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones37 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTCTGGC TCGACGAACT CAAACTGAAC GCTGACGCGC GGGTAGCCGT CATACACCTG 
GCAACCATCG ACGAAGAGGA CGCCATGCGC GCGCTTACCG GCTGGTCGCA ATCGGGCGAG
TGGCCGAAAG GCATGGGGCT CATCACCTGG GATATCGGCG ACCAGTTCCG GCACCTGCGG
GAACCCGCGG CAACCTTCAG CAAGCTCTCG GCCACCCCTG AAACCGTGCT CGACATCATC
GACGACTACA AGGGATCGGC AACATTCGTG CTCAAGGACT TCCACCATTT CTGGGAACAC
AGCCACAAGG TTTCGCGCAT GCTGCGGAAC CTCGCATCGC GTCTTCCGTT CCGCGCCGAA
ACGGTGAACA TCATCATTAC CAGCCCCCGG TTTGCGCTTC CCGCCGAACT CGCTCACGAC
ATTCCAACCA TCGATGTCGG CAAGCCCGAC GCCAGACAGA TGCTGGAACT GCTCGAACGC
GAAACCCGCT CGACGCGTTC TCTCGACAAC GCCACGCACG GCCTGCGCGA ACGTATGGTC
GAAAGTGCGC TCGGCCTTTC CGTCGTCGAA GCGGGCAGGG CGTTCCGCAA AGCTATCGTG
GTTGCCGGAG GAGAAGGGCT CGACGAACGG AGCGTCCGGC AGGTTTTGAA CGAAAAACGG
CACATCATCC GCGAAAGCGG CGCGCTCGAA CTCTACCCCT GTACCGGATC CATGAACAAC
GTCGGGGGAC TCGGAACCCT GAAAGCGTGG CTCGACGAAC GTCAGGAGGC CTTCAGCCAG
GATGCGCGCG AGTATGGCCT CAGCATGCCC AAAGGGGTGG CGCTCATCGG GATTCCCGGA
ACCGGAAAAA GCCTCTGCGC AAAGGTGACC GCAGGCCACT GGGGGATGAC GCTGCTGAGA
ATGGATGTCG GCGCCATCTT CAGCGGCCTG CTCGGTTCGA GCGAACAGAA CATTCGCGAA
GCCATCCGCA TTGCCGAGGT CATCGCGCCC TGCGTGCTCT GGGTTGACGA AATCGAAAAA
GCCTTTGCCG GCTCCATGGG CGACAGCGGC ACGGCAAGCC GGGTGTTTGC CACCTTCCTG
ACCTGGATGC AGGAAAAAAC CGCTCCGGTT TTTGTTTTCG CCACGGCCAA CAACGTCCGG
CGGCTTCCCC CCGAACTCCT TCGCAAAGGC CGATTTGACG AGGTTTTTTT CCTCGATCTT
CCCACCCACG GCGAGCGGAT AAAGATACTC GAAGTCCATC TCAGGGAGCG CGGCTACACC
ATGCTCTCGC AGCGGTTCAA CCTTGCCGCC GTCGCATCGG CCACCGAGGG GTTCGTCGGC
GCGGAGCTGC AGGCGCTGGT CAACGACGCC ATGTTTCCCG CATTCCGCGA CCATCGCCGG
GAGCTGGAAA CGCAGGATCT GCTCAATGCG GCCGGAGAGA TGGTTCCCCT TTCGGCATCG
CATCAGGAGC ATATCGAACA GCTGAGGACC ATGGTCACAA ACGGACAGGC GCGCAACGCT
TCCGACGACC GGAATGCTTT GGCTTCGTCG CAGAAATAA
 
Protein sequence
MVWLDELKLN ADARVAVIHL ATIDEEDAMR ALTGWSQSGE WPKGMGLITW DIGDQFRHLR 
EPAATFSKLS ATPETVLDII DDYKGSATFV LKDFHHFWEH SHKVSRMLRN LASRLPFRAE
TVNIIITSPR FALPAELAHD IPTIDVGKPD ARQMLELLER ETRSTRSLDN ATHGLRERMV
ESALGLSVVE AGRAFRKAIV VAGGEGLDER SVRQVLNEKR HIIRESGALE LYPCTGSMNN
VGGLGTLKAW LDERQEAFSQ DAREYGLSMP KGVALIGIPG TGKSLCAKVT AGHWGMTLLR
MDVGAIFSGL LGSSEQNIRE AIRIAEVIAP CVLWVDEIEK AFAGSMGDSG TASRVFATFL
TWMQEKTAPV FVFATANNVR RLPPELLRKG RFDEVFFLDL PTHGERIKIL EVHLRERGYT
MLSQRFNLAA VASATEGFVG AELQALVNDA MFPAFRDHRR ELETQDLLNA AGEMVPLSAS
HQEHIEQLRT MVTNGQARNA SDDRNALASS QK