Gene Bind_3371 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBind_3371 
Symbol 
ID6201634 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBeijerinckia indica subsp. indica ATCC 9039 
KingdomBacteria 
Replicon accessionNC_010581 
Strand
Start bp3826120 
End bp3828057 
Gene Length1938 bp 
Protein Length645 aa 
Translation table11 
GC content59% 
IMG OID641707317 
Producthypothetical protein 
Protein accessionYP_001834417 
Protein GI182680271 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value0.289065 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTGATT ATCTTCCTGC TCCTTCGACA ACTGATTACA TTACATATAC GGGCGTTGGC 
ATGGTGCCCG CCTCCACTGT CACAGTTTAC AATGGCAGTA CCGAGGCAGG CACGGCGACC
GTTGCGGCGG ATGGCACATG GTCCTTTACA TTCTCGGTAA CACCGGCCAG CGGGTCGGAA
ATTTATTACA CAGGTGAGTC CACAAGCTCG TCCATCACTG TCCCGTCCTT TGCGACGCTT
GTCCCGCTTT CACTCTCCCC GCTCACGGTG CAAGCGGGTG GCACATTCAG CGGCACAATC
ACAGGTCAAA CAGCTGGCTC GACGATTGCC GCCCTCTCGA CGGATGGGAC GGCTCTGACG
GTCAGCGGCG ACAATGTGAC CGGGACATTC TCTGCCAATG GCGTCATGAC AATCAGCCTG
ACCGAGATGC TCGCGGGGGC CACGAATACA CCCCATCTCT CGACCACAGA GATGACTGTC
TCGAATAATA CGTCTCAGGA CCCGGCATTC AGCCTCGCGC GGACCGTCCT TGTCGCGAAT
GGCGGCACGA GCCTCAGTGC CAATGGGTTC TCATTTAAGG CGGTCACGAT TTCCAATGTT
TCCGGTGTTT ATGCCGATGC CTATTACATG CGCGGCTATA TGACCCCGCT CATGTCTCTC
ATGGGCTGGA ACTATGAGAA CTCGCGCGTC TGGGCCATCG GTGGCAATAC GCTTGCTCAG
GTCCTCTCCA ACTACAGCGC AACGGGTGTC ACGCCGGCCA ATACGCGGTG GGGCACGCTC
GGATCGGGTG GCTGGTCGCC GGATATCTTC TTCGCTGAAG GCGGCTCGAA CGACCTTGGC
AACACATTGA CACAGATGCA ATCCGATGCC ACGGCGGTCG ATAATTATCT GCTCGGCCTC
AGTCCCGTCC CTCGCATCCT GAACGAGTCC ATCTGGCCGC GGACCAGCTT TCCGACCGGC
ACCGATCTGA CCTTCGTCCG GTCCGAGCGG CTCCAATTCA ATCAGTGGCG TGAACAGAAA
AGCGCCTCGT TTCCAAAGTT AAAAGCGTAT AACCTTGACG CACTCATGCA GGACCCGGCA
AATCCTGGTC AGGCGAACCC ACCCTATACC TATGACGGCG TGCATCCGAA TTCCAAGGGC
GCGATCCGCG TCGCGAACGA TATGTTCTCG AAACTGTCGT CGCTCAACAT GTTGCCGGCC
CCAGTGCTTC TCCCGCTGAC CCTTGATGCG GCGGCCGATC CGACCAATCT GCTGCAGCAC
GAAGGTGCAA CCAATTCGGC GCGACAGCTT CTCGCCGGTA CATCGGGTGG TTTGACCAAT
TTCGCGACAG GGAGCGTCAT ATGCTCCGGC TTCAACGCGT CCGCCGACGC AAGTTGGGGA
ACGACGGACA CCCAGGCGCT TCCGGCTTTC TCGATGGAGG CGGATGACAA CGCGACGAAA
CTGAAGCGGA TGGTCATTAC CTATCCGACC AGGACTTCGC CGGGAACACT CGGAAATGGC
AGTGCCGACG CGGGCAAAAT CTGGGTCGTC GTCGGGACAG GCGGGACGCT GATCTCCTCC
GCACGCGCCT CAAACTATGA TGGGGCAGGG GCGAGCATCG TGGCTGGTAA TCGCTACCGG
GCTGGTGTCC GGCTCCAGAT TATCAACGGC GTCAACCTGT TCCCGGCTGC GTTCTACTTC
AGGTGGAAAG TCGATGGGGT GACGGTGATT CAGACCTCGT GGGCCTGTTC CAGCCTTTAT
GACGCCAATG ACGCCTTCCC TGACGGCACC TATGACATAT TGTCGCCGAT CTTCACGGTG
CCGGCCTTCC AATCCTCGAT CGAGTTGACC AGTCTGCAAG TTTATCTCGC CAGCGTCCCC
GGAGCCCCGG TCGGTGGCAC GGTCAAGCTC AGCAAATGGT ATGTGCGGCC CTATCAGTTC
CTTGCCTCTT GGCCGTAA
 
Protein sequence
MTDYLPAPST TDYITYTGVG MVPASTVTVY NGSTEAGTAT VAADGTWSFT FSVTPASGSE 
IYYTGESTSS SITVPSFATL VPLSLSPLTV QAGGTFSGTI TGQTAGSTIA ALSTDGTALT
VSGDNVTGTF SANGVMTISL TEMLAGATNT PHLSTTEMTV SNNTSQDPAF SLARTVLVAN
GGTSLSANGF SFKAVTISNV SGVYADAYYM RGYMTPLMSL MGWNYENSRV WAIGGNTLAQ
VLSNYSATGV TPANTRWGTL GSGGWSPDIF FAEGGSNDLG NTLTQMQSDA TAVDNYLLGL
SPVPRILNES IWPRTSFPTG TDLTFVRSER LQFNQWREQK SASFPKLKAY NLDALMQDPA
NPGQANPPYT YDGVHPNSKG AIRVANDMFS KLSSLNMLPA PVLLPLTLDA AADPTNLLQH
EGATNSARQL LAGTSGGLTN FATGSVICSG FNASADASWG TTDTQALPAF SMEADDNATK
LKRMVITYPT RTSPGTLGNG SADAGKIWVV VGTGGTLISS ARASNYDGAG ASIVAGNRYR
AGVRLQIING VNLFPAAFYF RWKVDGVTVI QTSWACSSLY DANDAFPDGT YDILSPIFTV
PAFQSSIELT SLQVYLASVP GAPVGGTVKL SKWYVRPYQF LASWP