Gene Bind_3643 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBind_3643 
Symbol 
ID6199608 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBeijerinckia indica subsp. indica ATCC 9039 
KingdomBacteria 
Replicon accessionNC_010581 
Strand
Start bp4134211 
End bp4135176 
Gene Length966 bp 
Protein Length321 aa 
Translation table11 
GC content60% 
IMG OID641707594 
ProductN-formylglutamate amidohydrolase 
Protein accessionYP_001834684 
Protein GI182680538 
COG category[E] Amino acid transport and metabolism 
COG ID[COG3741] N-formylglutamate amidohydrolase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00118942 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value0.928128 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGTCGG ACGATCCTGG CAAGGACACG ACAAAAATCC AGCACGGGGA GGAGGCGCGA 
GATCTCGATT TCAACCCGCC TTTTGAGGTC CTGGAGCCTG AGACGCTGAC GTGTCCACTG
GTGTTTTCCT CACCGCATTC TGGATCTCTT TACCCACGCC GCTTCCTCGT ATCGGCTCGG
CTCGATGCCT TGACCCTGCG TCGTTCCGAG GATGTGCATG TCGATGCTTT GTTCCGCGGC
GTGGCGGGGC TTGGCGCGCC TTTGATCCGG GCCCATTTTC CGCGCGCCTT TCTGGACGTT
AATCGCGAGC CCTATGAGCT CGATCCCAAA ATGTTCGACG GCAAGTTGCC CGTTTTCTCC
AATACGAGAT CATTGCGGGT CGCCGCTGGG CTCGGCACGA TCGCCCGTGT GGTCGGTGAA
GCGCAGGAAA TCTATTTGGG ACGCTTGCCC GTCGAAGAAG CCATGTGGCG GATCGACCGT
CTCTATAAGC CTTATCATCG CGCTTTGCGG GCACTGCTCG AACGCGCCGA AAAAACCTTC
GGCGTTGCGC TTCTGGTCGA TTGCCATTCC ATGCCTTCGA ACACGCAGGC GGGCCTCGGA
CAAAGCGAGA GTCGCGGCCC CGCGAGCCGC CCCGGAAACC GGCCGGATTT TGTGCTCGGC
GACCGCTATG GGACGAGCTG CGCTGTCGAT CTTGTGGAAA CCGTGGAACA GGCCCTGCGA
CAGATGGGCT ATCAGGTCCA GCGCAATAAA CCCTATGCCG GTGGCTTTAT CACCGAGCAT
TACGGCAATC CCGCCACGCA TTTTCATGCC TTGCAGATCG AAGTGAGCCG CGGGCTCTAC
ATGGACGAGA GGACCTTCGA ACCAAGTCCA TGTTTTGCGA CTGTTGCGGA AGATCTGACC
AGAATGGCGG CGGCCCTGGC GGCGGCAATC GCCGATCGTC GGCCCCAACA GGCCGCAGCG
GAATAA
 
Protein sequence
MTSDDPGKDT TKIQHGEEAR DLDFNPPFEV LEPETLTCPL VFSSPHSGSL YPRRFLVSAR 
LDALTLRRSE DVHVDALFRG VAGLGAPLIR AHFPRAFLDV NREPYELDPK MFDGKLPVFS
NTRSLRVAAG LGTIARVVGE AQEIYLGRLP VEEAMWRIDR LYKPYHRALR ALLERAEKTF
GVALLVDCHS MPSNTQAGLG QSESRGPASR PGNRPDFVLG DRYGTSCAVD LVETVEQALR
QMGYQVQRNK PYAGGFITEH YGNPATHFHA LQIEVSRGLY MDERTFEPSP CFATVAEDLT
RMAAALAAAI ADRRPQQAAA E