Gene Clim_1571 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagClim_1571 
Symbol 
ID6354219 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium limicola DSM 245 
KingdomBacteria 
Replicon accessionNC_010803 
Strand
Start bp1688873 
End bp1690414 
Gene Length1542 bp 
Protein Length513 aa 
Translation table11 
GC content49% 
IMG OID642669175 
Producttransposase IS4 family protein 
Protein accessionYP_001943597 
Protein GI189347068 
COG category[L] Replication, recombination and repair 
COG ID[COG3666] Transposase and inactivated derivatives 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.0160489 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTATCG ATTTTATCAG AGCTGATCGT GATACCCCGT TTTTGTTTCC TCCATCAGTG 
CAGGAATGGC TCCCGGCAGA TCATTTGGCC CGGTTTGTCG TTGATATTGT CAGCCAACTC
AACCTGTCGT CACTCAAGGA CGTCTATGCC GGTAGAGGAT CAAGGCCATA TGATCCGGCC
ATGCTGCTCT CCTTGTTGTT TTATGGCTAT GCGACTGGCG TGTTTTCCAG CCGGAAGCTT
GAACAGGCAA CCTATGATTC CGTAGCGTTC CGTTATATCA CCGGTAATCA GCACCCTGAT
CACGATACTA TCGCCATGTT CCGGAAGCGG TTTTCAGGAG AACTCAAGGA TTTGTTTCAC
CAGATTCTCA TCATTGCTCA TGAGATGGGG ATCCTGAAAA TCGGGACGGT AAGTCTCGAT
GGCACGAAAG TCAACGCCAA TGCATCGAAG CATCAGGCGC TGAGCTGGGA TCATGCCGAT
AAAATTGCCA GGCAATTGCA AGAAGAGATT GACCAGTTAT GGTTATTGGC AGAACAAGCT
GATCAGTCCG TGATTCCCGA TGGCATGAAA ATACCCGAGG AGCTTGAGCG GCGGGAAACC
AGACTTGCCA GCATAATTGA AGCCAAACGG AAAATAGAAG CGCGAGCCGA TGAACGCTAT
GCAGAAGAAA AGCAAGTATA CGAGAACAAG TTGGCAGAAC GGGAAAAGAG AGAAAAAGAA
CGGGGCAAAA AATCCGGAGG CAAGCCTCCC AAACCGCCGG AACCGGGCCC GAAACCACAC
GATCAAGTCA ATCTGACCGA TGAAGAATCA CGGATCATGC CTGCAAGTGG CGGCGGTTTT
CTGCAGGCGT TCAATGCCCA GGCATGCGTT GATATCGCAA CCTTGCTTAT TGTGGCAGCA
CACACGACAC AACAGCCCAA CGATAAAAAA CAGATCGAGC CGGCTCTCGA AGCATTGGGA
AACTTGCCCG AAGAACTTGG TCAAGTCAAC GAACTGCTTG CCGATACCGG ATATTACAGT
CAGGCAAATG TCGAGGCCTG TGAAGCCGCC GGAATCAATC CACTTATAGC GATATCGCGA
GAATCGCATA ATCAGAAACT TGAAGATCGA TTCAGTGAAC CAGAACCAGT ATCCGAGACG
GCTGACGGTA TCACCAAAAT GAGGTACCGG TTGAAGAGCA AAGAAGGTAA GGCACTGTAT
GCCAAACGGA AGTGCACTGT GGAACCGGTA TTCGGTATCA TCAAATCGGC CCTTGGGTAT
CGCCAGTTTT TACGCAGAGG CTTTGAGAAT GTCAATGCGG AATGGACACT GGTAAGCATG
GCATGGAATC TGAAACGCAT GCACGTTTTG ACCAAACCAC GCATTAAAAA CCCGGTTGTT
GCAGCCTGCA AAGCGAAATC GGATGCTTAT AGCGACCGAT TATCCGAAAT CTGGCGTGTT
AAGCTTTTCG AAATGCATCA TGTGGCAAAA ATAGCCACTA TACAGGCTAT TGAAACCATT
TCTTCGATGA AATGTCAATT TGTCAGTCCG ACAGGCTGCT AG
 
Protein sequence
MSIDFIRADR DTPFLFPPSV QEWLPADHLA RFVVDIVSQL NLSSLKDVYA GRGSRPYDPA 
MLLSLLFYGY ATGVFSSRKL EQATYDSVAF RYITGNQHPD HDTIAMFRKR FSGELKDLFH
QILIIAHEMG ILKIGTVSLD GTKVNANASK HQALSWDHAD KIARQLQEEI DQLWLLAEQA
DQSVIPDGMK IPEELERRET RLASIIEAKR KIEARADERY AEEKQVYENK LAEREKREKE
RGKKSGGKPP KPPEPGPKPH DQVNLTDEES RIMPASGGGF LQAFNAQACV DIATLLIVAA
HTTQQPNDKK QIEPALEALG NLPEELGQVN ELLADTGYYS QANVEACEAA GINPLIAISR
ESHNQKLEDR FSEPEPVSET ADGITKMRYR LKSKEGKALY AKRKCTVEPV FGIIKSALGY
RQFLRRGFEN VNAEWTLVSM AWNLKRMHVL TKPRIKNPVV AACKAKSDAY SDRLSEIWRV
KLFEMHHVAK IATIQAIETI SSMKCQFVSP TGC