Gene Bind_3478 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBind_3478 
Symbol 
ID6199739 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBeijerinckia indica subsp. indica ATCC 9039 
KingdomBacteria 
Replicon accessionNC_010581 
Strand
Start bp3953528 
End bp3955054 
Gene Length1527 bp 
Protein Length508 aa 
Translation table11 
GC content62% 
IMG OID641707432 
Product5'-nucleotidase domain-containing protein 
Protein accessionYP_001834524 
Protein GI182680378 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0737] 5'-nucleotidase/2',3'-cyclic phosphodiesterase and related esterases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.622587 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.226915 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATCCATC TGACCCGCCG CCAGACCCTC CTCTCCGGCC TTGCGGCCGG CACCAGCCTC 
TTCGCCCATG CCAGCAAGGC GGCGGATCAT GTTACGGCGC TGACGTTCCT GCTCGTCAAC
GATCTTTACC GGATCGATGC CGATGCGCAA GGACGCGGCG GCTTCGCTCG TCTCACGAGC
GTGGTCAAAG CCGAACGCGC GCGCGCCGCC GCCCGAGGCG CAAAACTCAT CTGCGTCCAT
GCAGGCGATG CGCTGTCGCC GAGCCTGTTA TCGAGTCTCG ATCACGGCGC GCATATGATC
ACGCTCCTCA ATGAAATCGG TTTCGACGCT TTTGTGCCCG GCAATCATGA ATTCGATTTC
GGCCCTGATA CCTATCGCCA GCGCATGCAG GAGGCGCATT TTCCAGTTCT GGCCGCCAAT
CTGCGGGAAG CCGATGGATC AGCCCTGCCG GGCCATGAAA ATGCCCTGAC TATTACCCTT
GAGGGGATGC GCATCGCTTT GATTGGCGCG GCTTACGAGG CAACGCCAAC CGTCTCGCAG
TCAGGCGCTT TGCTCTTCAG CCCGACGCTC GCGACAATCG CCGACGAGAC GCGTAAAGCG
CGCGCCCAAG GCGCCGATTT CGTCGCCGCC ATCGTTCATG CGGACAGGGC CACCGGCCGC
GCTCTTATGG ACATGAAGGG ACCGGACCTG ATTCTCTGCG GCCATAATCA CGACCTGCAC
ATCGACTTCG ATGGCCGCAC CGCCTTCATG GAATCGGCAC AGGATGCCAA TTATGTGCTG
AGCGTCGATC TCGACCTTAC CAAGAACGCG ACCGGACTCG GCTGGTGGCC GAATTTCCAT
GTGGCCGATA CACGCCAGAC CCAACCCGAT ACGGATATGG AGGGAAAGGT CCAACACCTC
CTCGCCAACC TGGCCGCGAC CCTTGCCGCG GATCTCGCCC GTACGCAATC CCCGCTCGAC
AGCCGCACGC AGATCGTGCG CGGCGAGGAA TGCACGATCG GCAATTTGTT TGCCGATGCG
ATCCGGCAAC GGACCGGCGC GGAAGCCGCG CTGATCAATG GCGGCGGCAT ACGCGGCAAT
CGCCTCTATC CGGTGGACAG CATGCTGACC CGCAAGGATA TTCTGGCCGA ACTCCCCTTC
GATAACAAAA CTTTGGTCCT GCCGATCGAG GGAAAAAGAC TGCTGCTCGC GCTCGAAAAC
GGGCTTTCGC GCGCCGAACA TCCAAGTGGG CGTTTCCCAC ACGTCTCTGG CCTCATCGTC
GAGGCCGATC TCACGCGTCC GCCGGGTTCC CGCGTGCAAC AGGTCCGCAT CAACGGAGAC
CCCCTTGCAC CGGAGCGCAT CTATCAGCTC GCCACCAATG ATTATCTGGC GCGGGGCGGC
GATGGTTATC TCATGCTGGC GGGCCGCGCC GATATCACCA CGGATTCCGG GAACCGTCTC
ATCGCCCAGG ATGTCGGGGA TTATCTCGCC TTAAAGGGGC AAGTAGCGCC GCAAATCGAG
GGCAGGATAG TTCTGCGCCG GTCCTGA
 
Protein sequence
MIHLTRRQTL LSGLAAGTSL FAHASKAADH VTALTFLLVN DLYRIDADAQ GRGGFARLTS 
VVKAERARAA ARGAKLICVH AGDALSPSLL SSLDHGAHMI TLLNEIGFDA FVPGNHEFDF
GPDTYRQRMQ EAHFPVLAAN LREADGSALP GHENALTITL EGMRIALIGA AYEATPTVSQ
SGALLFSPTL ATIADETRKA RAQGADFVAA IVHADRATGR ALMDMKGPDL ILCGHNHDLH
IDFDGRTAFM ESAQDANYVL SVDLDLTKNA TGLGWWPNFH VADTRQTQPD TDMEGKVQHL
LANLAATLAA DLARTQSPLD SRTQIVRGEE CTIGNLFADA IRQRTGAEAA LINGGGIRGN
RLYPVDSMLT RKDILAELPF DNKTLVLPIE GKRLLLALEN GLSRAEHPSG RFPHVSGLIV
EADLTRPPGS RVQQVRINGD PLAPERIYQL ATNDYLARGG DGYLMLAGRA DITTDSGNRL
IAQDVGDYLA LKGQVAPQIE GRIVLRRS