Gene Bind_3065 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBind_3065 
Symbol 
ID6198159 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBeijerinckia indica subsp. indica ATCC 9039 
KingdomBacteria 
Replicon accessionNC_010581 
Strand
Start bp3496542 
End bp3498746 
Gene Length2205 bp 
Protein Length734 aa 
Translation table11 
GC content53% 
IMG OID641707013 
ProductTPR repeat-containing protein 
Protein accessionYP_001834116 
Protein GI182679970 
COG category[O] Posttranslational modification, protein turnover, chaperones
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG3914] Predicted O-linked N-acetylglucosamine transferase, SPINDLY family
[COG5010] Flp pilus assembly protein TadD, contains TPR repeats 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0065343 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCGAGA ATCTTAATTC ACTCCTGCAC TCACACCCTT CTCGATTCGG CGTGGGGCCT 
CAACAGCAAA TCTTGCTGCA AGCAATGACT CCCCCCATCC AGATCCGTCG CGGTCGTAAA
CATCATGCAA AAGGTGTCGA ATACGCTCGC GAAAACCGAT GGAGACAAGC CCTCCATGAA
TTCGAAGAGG CAGCGCGCCG AGCACCACAT GAACCGAATT TTCATTATTC GCACGGGGTC
GCATTATGTC ACTTCAACCG GTTTGCCGAA GCCATCAAAG CCTTTGAGCG TGAGCTTGCA
GTCATACCTG AGCATGGATC AGCGCTGACT GAATTGGGTG CCTGTTTTGC ACGAATAGGT
CGCACCCGGG AAGGCATTTC CTATCTACGG AAAGGCCTCA AACTGATGCC TTATCTGCCA
CAGGCTCAAT TTAATCTCGG ACTCGCTCTG CTGACCGAAA GCCGCCGCCA AGAGGCGATC
GAGGCATGTG ACCGGGCGAT CGAACTTGAT GCGAACTATG CGGATGTCTA TCGGCTGCGC
GGCCTAGCCT ATGCGATGAG TGATGAGGAC GAGAAGTCCT TTCATGACCT GCGTATAGCG
GCCGTTCTCG ACAACAAAAA TTACGACATT CTAGTAAAAC TCGGCACGCA GAATAGCGAA
AAATCGAATC TGCCACAAGC GAGTCGCCTC CTTGAAGTGG CAGCGAAAGT CGCTCCGAAA
ATGGCTCTGG CGCAATATGC CTGGGGCCAC TTTCTCATCG GTCATCGTAT GTACGAACTT
GGATTAAGTT TCGTCGACCG TGCTATCGAA TTAGATCCGT TGCAATGGGC ACCGTATCTT
GCACGCGCCT TGGGCTTTCT GGGGCAAGGC CGTATCGAAG AGGCAATGGG ATGCTATCGC
CGAGCAAGCG AGATAGCACC CGACAATATA GACGTTGCAG GTGCGCCACT TTTTACTTTG
CAGCATAAGC CGGGGGTTAC AGAGGCGGAT TTGTTGCAGG CCCATAAAAA ATGGGGAAAA
CTGGCTCAGC CTCACGCGTC GAAAGATAGA CTGTCTTTTA CGAACAATCC CGACCCACAA
CGGAAACCAC GGATAGGGCT GGTGTCAGGC GACATGCATC GCCATGCCGT GGCCTTCTTG
ACCCTACGGG CATTCGAGCA ACTCGCGACG CTAGGCTATG AGATTTTTTG TTATAAGACG
GACCCAAAGC GCCGTGACGA CGATTTCAGC GAGCGATATA AAGCCTTTGC GAAATCCTGG
CATGATATAT CCGAGCTCAA TGATACCGAA CTTGCCGAGT TGATTGCCGA GCAAGAAATC
GACATACTTT TCGACCTCGC TGGGCACACA AGCGGAAACC GCCTCTCCTT ATTCGCCATG
CGTGCGGCGC CCATCCAGTT GACCTGGGCC GGCTATGTCG GCACGGTTGG GCTCGATACT
TACGATGGCA TTATTGCCGA CCCGGTTGAA ATTCCGCTTG AGCATGATTC GTTCTATCTG
GAACCGGTCA TCCGTTTACC TGATTGCTAT GTTTGCTATC ATCCACCGAC TCAAGAGGTT
GATGTCGGTC CGCTCCCCTA TACTAAGACA GGAACTTTCA CGTTCAGCTG TTTCAACCGC
CCCGCGAAAC TCAATAGTGA AGTGGCTCGG GCTTGGTCCA AGATCCTGGA ACAAGTGCCA
AACGCACGTA TTCTGATGGT CTATGGCGGT TTGGGCGAGG CGAGCACACA GGAAGCCATC
TATAAAGTCC TTGAAAGCGG GGGGCTTGCA CGTGAACGCG TGGAACTTGT TGGCGAAACC
AATCAATTGA AACTTCTTGA GGCTTATGCC GAGAGAGTTG ATCTCGCGCT TGATCCCTTC
CCCTATTCAG GAGGGGTCAC GACACTCGAA GCCATGTGGA TGGGTGTCCC GACGATTACA
TGCGTCGGCG ATACTTTTGC TGGACGTCAT TCCGCGTCAC ATCTGACCGC GGCAGGGCTT
GCGGATTTCT GCACCCCTAC CGTGGAAGCC TATATTAATT TGGCCGTGGA ATGGACCAAA
CGGCCACACG AACTGGCCGC CTTGCGCGCC AGCCTGCGCG ATAAAGTCGC TGCATCGCCC
TTAAATAATC ATGTTCTCTT TGGTCATCAT CTTGATGAAG CATTAACACA ACTATGGAGA
GAATGGTGCA TGGTGCGCAT TGCGAAATCA GATCTTGAAA TCTGA
 
Protein sequence
MAENLNSLLH SHPSRFGVGP QQQILLQAMT PPIQIRRGRK HHAKGVEYAR ENRWRQALHE 
FEEAARRAPH EPNFHYSHGV ALCHFNRFAE AIKAFERELA VIPEHGSALT ELGACFARIG
RTREGISYLR KGLKLMPYLP QAQFNLGLAL LTESRRQEAI EACDRAIELD ANYADVYRLR
GLAYAMSDED EKSFHDLRIA AVLDNKNYDI LVKLGTQNSE KSNLPQASRL LEVAAKVAPK
MALAQYAWGH FLIGHRMYEL GLSFVDRAIE LDPLQWAPYL ARALGFLGQG RIEEAMGCYR
RASEIAPDNI DVAGAPLFTL QHKPGVTEAD LLQAHKKWGK LAQPHASKDR LSFTNNPDPQ
RKPRIGLVSG DMHRHAVAFL TLRAFEQLAT LGYEIFCYKT DPKRRDDDFS ERYKAFAKSW
HDISELNDTE LAELIAEQEI DILFDLAGHT SGNRLSLFAM RAAPIQLTWA GYVGTVGLDT
YDGIIADPVE IPLEHDSFYL EPVIRLPDCY VCYHPPTQEV DVGPLPYTKT GTFTFSCFNR
PAKLNSEVAR AWSKILEQVP NARILMVYGG LGEASTQEAI YKVLESGGLA RERVELVGET
NQLKLLEAYA ERVDLALDPF PYSGGVTTLE AMWMGVPTIT CVGDTFAGRH SASHLTAAGL
ADFCTPTVEA YINLAVEWTK RPHELAALRA SLRDKVAASP LNNHVLFGHH LDEALTQLWR
EWCMVRIAKS DLEI