Gene Clim_0786 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagClim_0786 
Symbol 
ID6353856 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium limicola DSM 245 
KingdomBacteria 
Replicon accessionNC_010803 
Strand
Start bp860463 
End bp861830 
Gene Length1368 bp 
Protein Length455 aa 
Translation table11 
GC content48% 
IMG OID642668410 
Producttransposase IS4 family protein 
Protein accessionYP_001942845 
Protein GI189346316 
COG category[L] Replication, recombination and repair 
COG ID[COG3666] Transposase and inactivated derivatives 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value0.980261 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCATTCAG ACTTTCTTGT CGCCGATCGA GATTCGCTTT ATCTTTTGCC ACCATCCGTT 
CAGGAGTGGT TGCCGGCCAA TCATCTTGCA CGCTTTATTG TTGATATTGT TGCGCAGCTT
GATCTTACTC CATTGAGAGA CGCCTATGCA GGCAGAGGTT GCAAAGCCTA TGATCCGGCG
ATGTTGCTGA CTCTCTTGTT TTACGGTTAT GCCACTGGAA CGTTTTCAAG CAGAAAGCTG
GAACTTGCCA CTTATGAATC CATAGCGGTC CGCTATATCA CCGGAAACAG TCATCCAGAT
CATGACACCA TAGCAAATTT TCGGAACCGC TTTCTCGGCG AACTGAAACC CTTTTTTATC
CAGATTTTGA GTCTTGCTCA CGAAATGAAC ATCCTCAAGA TCGGCAAGAT CAGTATTGAT
GGCACCAAAA TCAAAGCCAA TGCTTCCAAA CATCAGGCAC TGAGTTGGGG GCATGCTTGC
AAAATCGAAA AGCAGTTGAA AGAGGAAGTT GACTCCCTGC TTCGTCAGGC TGAACTTGCA
GACCAGTCAA CAATTCCTGA CGGGATGAGT ATTCCTGCAG AGCTTGAGCG CCGTGAAAAA
AGGCTTGAAG CTATTGCCAA GGCAAAGTGT GAGATTGAAC GCCGGGCTGA GGAGCGATAC
GAAAAAGAAA AAGCTGAACA TGTGGCAAAA CTGGCAGAGC GTGAACGGAA AGCGCAAGAG
AGCGGCAAGA AAAGTAGAGG CAAAATACCG AAAGCACCAG AGCCGGGAGT GAAGGATCGC
GACCAGGTTA ATTTGACGGA CGAGGAGTCA AGAATCATGC CGGTATCGGG CGGTGGATTC
ATGCAGGCCT ATAACGCTCA GGCGAGTGTT GATCTCGACA CCATGCTGAT GGTCGCAGTT
CACGTCACCC AACATACAAA CGATAAACTT GAGCTCCAGC CAGCTTTTGA TGAGCTAAAA
AAACTACCTG CAAAGCTGGG AAAAGTAGAG GAGGCAACTG CTGACGCCGG ATATTTCAGC
GAAAAGAATG TTGAGCTTTG TGAAACAGAA GAGATAGTCC CCTACATTGC GGCTGGACGA
GAATCACATA ATCAGTCACT TGCTGACCGA TTCAGCGAAC CAGAGCCATT AGCAAAAGAT
GCTGATGCGG TAACAACAAT GAAACACCGC TTGAAAACAA AGGATGGAAA GGCATTCTAT
GCACGCCGCA AATGTACGGT TGAACCGGTG TTTGGTGTCA TAAAATCAGT GCTGGGCTTC
CGGCAGTTCT TGCTCAGAGG CATAGAAAAT GTCACAGGGG AATGGAATCT TGTCGGTATT
GCGTGGAATC TGAAGCGATT GAATGTGTTA CGCCAGATAA TGGCCTGA
 
Protein sequence
MHSDFLVADR DSLYLLPPSV QEWLPANHLA RFIVDIVAQL DLTPLRDAYA GRGCKAYDPA 
MLLTLLFYGY ATGTFSSRKL ELATYESIAV RYITGNSHPD HDTIANFRNR FLGELKPFFI
QILSLAHEMN ILKIGKISID GTKIKANASK HQALSWGHAC KIEKQLKEEV DSLLRQAELA
DQSTIPDGMS IPAELERREK RLEAIAKAKC EIERRAEERY EKEKAEHVAK LAERERKAQE
SGKKSRGKIP KAPEPGVKDR DQVNLTDEES RIMPVSGGGF MQAYNAQASV DLDTMLMVAV
HVTQHTNDKL ELQPAFDELK KLPAKLGKVE EATADAGYFS EKNVELCETE EIVPYIAAGR
ESHNQSLADR FSEPEPLAKD ADAVTTMKHR LKTKDGKAFY ARRKCTVEPV FGVIKSVLGF
RQFLLRGIEN VTGEWNLVGI AWNLKRLNVL RQIMA