Gene Bind_2571 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBind_2571 
Symbol 
ID6199182 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBeijerinckia indica subsp. indica ATCC 9039 
KingdomBacteria 
Replicon accessionNC_010581 
Strand
Start bp2935271 
End bp2936311 
Gene Length1041 bp 
Protein Length346 aa 
Translation table11 
GC content56% 
IMG OID641706545 
Producttransposase IS116/IS110/IS902 family protein 
Protein accessionYP_001833659 
Protein GI182679513 
COG category[L] Replication, recombination and repair 
COG ID[COG3547] Transposase and inactivated derivatives 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones32 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones45 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGCAAGA TTGCCGCTGA TCTCAACACC ATTACGACCG TTGGATTTGA TCTCGCAAAA 
CACGTCTTTC AGGTCCATGC AATCGATGCA GAGGGCCGAG TCGTCATAAC TAAAGCAATC
CGACGAAAGG ACGTCTTGAA GTTTTTTGCA GATTTGCCTC GCTGCGTCGT GGGTTTGGAG
GCATGTGGCT CGGCCCATCA CTGGGCACGG GAACTGATCA GGCTTGGCCA TGATGCCCGT
CTGATGCCAC CAGCCTATGT AAAGGCCTAC GTGCGCCGTC AAAAGAATGA TGCTGCTGAT
GCGGCGGCGA TTTGCGAAGC GGTAACGAGA CCGTCCATGC GCTTCGTCCC CGTACGATCC
GTCGAAAACC AAGCCACACT TATGCACCAT AAGGTGCGGG AGCTTCTGGT CGGGCAACGA
ACACAGCTCC TTAATGCCCT AAGAGGACAT CTGTCCGAGA TCGGCATAAT CGTCGCGCAG
GGACTGAACA ATGCGCGTGC GCTCGCAAGC CTCATCATTG AAGAGAACGA CATAGTTCCT
GCCATTGTGC GCTCAGCCTT GGAGCCATTG GTCCGGCAAC TGGTTCAGCT TGATGAAGAG
ATCGGGCAAA GTGACCGCGC GATCCTCGCC ATTGCCAGGT CCGATGGAAT GGCTCGCCGC
CTGATGACGG TTCCAGGAAT AGGCCCCATC ACCGCCTCGG CTTTCGCAGC GAGCGTACAG
GATGTCTCGG CTTTTTCGGG TCCGCGCGAG TTTGCTGCGT TCCTTGGCCT TACCCCAAGG
CAAGCGTCCT CTGGTGGAAA GGAGCGTTTG GGGCGGGTTT CCAAAATGGG CAACCGATAT
TTACGGAAGT TGCTTGTGGT CGGAGCACAT GCTGTTCTCT ACCATCGCCG GTCCAGCACC
GATGCACTGA GAACCTGGGC AGATAGATTG TTGGACACCA AGCCCTTTAA GCTTGTCGCC
GTGGCCATTG CGAACAAGCT TGCCCGCATC GCTTTCGCGA TCATGCGCGG CGAGGCAAGT
TATGGGAAAA TGCCAGCCTG A
 
Protein sequence
MGKIAADLNT ITTVGFDLAK HVFQVHAIDA EGRVVITKAI RRKDVLKFFA DLPRCVVGLE 
ACGSAHHWAR ELIRLGHDAR LMPPAYVKAY VRRQKNDAAD AAAICEAVTR PSMRFVPVRS
VENQATLMHH KVRELLVGQR TQLLNALRGH LSEIGIIVAQ GLNNARALAS LIIEENDIVP
AIVRSALEPL VRQLVQLDEE IGQSDRAILA IARSDGMARR LMTVPGIGPI TASAFAASVQ
DVSAFSGPRE FAAFLGLTPR QASSGGKERL GRVSKMGNRY LRKLLVVGAH AVLYHRRSST
DALRTWADRL LDTKPFKLVA VAIANKLARI AFAIMRGEAS YGKMPA