Gene Bind_3814 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBind_3814 
Symbol 
ID6198005 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBeijerinckia indica subsp. indica ATCC 9039 
KingdomBacteria 
Replicon accessionNC_010580 
Strand
Start bp124252 
End bp125901 
Gene Length1650 bp 
Protein Length549 aa 
Translation table11 
GC content60% 
IMG OID641703946 
Producttransposase IS204/IS1001/IS1096/IS1165 family protein 
Protein accessionYP_001831098 
Protein GI182676951 
COG category[L] Replication, recombination and repair 
COG ID[COG3464] Transposase and inactivated derivatives 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGGAAGA AGATCCATTT GGCCTTCGGG CTTGGCATTG CTGTCGAAAA GATCGACCAT 
CGCGAGGAAG GCTGGGTGGT TTCAGCTCTC GCGTCTGGAA CCCGCAGCTG TCCCGGCTGC
GGTGTGGTCT CAACCAGGCG GCACAGTTGG CACGTCAGAC ACCTGCAGGA TCTGCCGATC
CAGGGGATCC CTGTGACCAT CGCGCTGCAG CTGGGACGCT GGCGCTGCCG CAACGAGAGC
TGCTCGCGCA AGACTTTCGT CGAGAAGATC TCGACCGCCT TTCCGTTTGC CCGACGGACC
GCACGGGTCG GTGAGATCAT TCGCCTCTTT GGCCATGCCG CGGGAGGCCG GGTTGGTGCA
AGACTGTTGG ACCGCCTTGC CATGCCAACC AGCCATAACA CGGTCCTGAG GCATCTCAAA
CGGCATGCTT CGGCGAGCAA GCTCAAGGCT CCCCTTCGGA TTGCCGCTAT CGATGACTGG
AGCTGGCGGC GGGGCGAAAC CTACGGCACG ATCATCGTCG ATCTCGAAAG GCGGACCGTT
GTCGATGTCT TGCCCGTTCG TTCCGTCGAG AGCACGGAGC ACTGGCTCAG GCAGCATCCT
GGCATCGAGA TCGTCAGTCG GGATCGGTGT GGCCTCTATG CCCAGGCCAT TCGGCAGGGC
GCGCCCCAGG CCCAGCAGGT GACCGACCGG TTTCATCTGC TGCAGAACCT GCGGGAGGCC
ATCGAGCGCC AGATGGAACG GGTCAGCCGG TTTGCCGGTC GCTCTCTTCT GCCAGCGGGT
TCTGACGCCA AACGGGAAGC ACCCCGGCAG GCAAGCCGGG AAGCCCGATT GGCGTTATTT
CAAAATGTTC ACGAGCTACA CTCGGCCGGA ATGCCTATCA CGGCGATCAA GGACAAGACC
GGGCTTGCCC TGCACACGCT GCGCCAATGG GTCCGTCTGG ATGACTTGCC GGCGCGCCGT
CACCCCGCTC CGACCGCCAG ATCACCGGCT TCTTTCAAGG ATTTCCTGAA GCAGCAATGG
GAGGCTGGAA ACCGATGTGG ACGCCATCTT CTGCATGATC TCCGCCATCG CGGCTATACG
GGCAGCCGCT CTCACCTCTA TCACTTCATC GCGGAATGGC GGCGGCTTGA GCCGGATGAG
AGCAGGAACA TCAAGGCGTC ACCAACACCA CATACGCCAC TGGCGGAAAC GAAAGCAATT
GACCCAGTGA CGGGCTGGCA GATCTCGCCG AAAGTCGCCG CAGTGCTGTG CCTGAAGCCG
ACGCGCTTGC TGACACCCCG TCAAGCACTC AAGGTCAAAG CCCTGAAACA GGCCTCTCCA
AGCTTCGTCA CCATGCGGGC TCTAGCGATG CGGTTTCGTG GTCTTATGCG CAGCAAGGAG
CCATCAAAGC TCGAAAAATG GCTCGAGAAG GTAAGACATG CCGCCATTCT CCCTCTGCAG
CAATTTGCCA AAACGCTGAG GCGTGATCTC GCCGCTGTCC GAAACGCTAT TACTCAGCCT
TGGAGCAGTG GGCAAGCGGA AGGACAGATC AACCGCTTGA AAACACTCAA GCGGACGATG
TACGGAAGAG CTGGCAATGA GCTGCTCCGC GCTCGGATGA TGCCGTTTGA TTTCGTAAAT
GAAACTGTGA ATGGAATGCC TGATCCTTGA
 
Protein sequence
MRKKIHLAFG LGIAVEKIDH REEGWVVSAL ASGTRSCPGC GVVSTRRHSW HVRHLQDLPI 
QGIPVTIALQ LGRWRCRNES CSRKTFVEKI STAFPFARRT ARVGEIIRLF GHAAGGRVGA
RLLDRLAMPT SHNTVLRHLK RHASASKLKA PLRIAAIDDW SWRRGETYGT IIVDLERRTV
VDVLPVRSVE STEHWLRQHP GIEIVSRDRC GLYAQAIRQG APQAQQVTDR FHLLQNLREA
IERQMERVSR FAGRSLLPAG SDAKREAPRQ ASREARLALF QNVHELHSAG MPITAIKDKT
GLALHTLRQW VRLDDLPARR HPAPTARSPA SFKDFLKQQW EAGNRCGRHL LHDLRHRGYT
GSRSHLYHFI AEWRRLEPDE SRNIKASPTP HTPLAETKAI DPVTGWQISP KVAAVLCLKP
TRLLTPRQAL KVKALKQASP SFVTMRALAM RFRGLMRSKE PSKLEKWLEK VRHAAILPLQ
QFAKTLRRDL AAVRNAITQP WSSGQAEGQI NRLKTLKRTM YGRAGNELLR ARMMPFDFVN
ETVNGMPDP