Gene Bind_1654 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBind_1654 
Symbol 
ID6200473 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBeijerinckia indica subsp. indica ATCC 9039 
KingdomBacteria 
Replicon accessionNC_010581 
Strand
Start bp1870025 
End bp1871692 
Gene Length1668 bp 
Protein Length555 aa 
Translation table11 
GC content60% 
IMG OID641705645 
Productformate--tetrahydrofolate ligase 
Protein accessionYP_001832774 
Protein GI182678628 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG2759] Formyltetrahydrofolate synthetase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.0498666 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCAGTG ATCTGGAAAT TGCCCGCGCC GCGAAACTTC GGCCGATTGC CACCGTCGCC 
GACGAGGCCA AGATTCCAGC CGAGGCGTTG CATTCCTATG GCCTGCATGT GGCCAAGATC
GACACGAGCC TGTTGCCGAA AAAGGACCGC CCGGCCAAGC TAGTCCTTGT GACCGCCATC
AATCCGACCC CGGCGGGCGA GGGCAAGACA ACCACCACAA TCGGACTCGG TGATGCCTTG
CGCCGTCTCG GCAAGGCTTG TGTCATCGCT TTGCGTGAAC CTTCGCTTGG TCCCTGTTTT
GGGACCAAAG GGGGGGCCAC GGGCGGCGGT TACGCGCAGA TCGTGCCGAT GGAACGCATC
AATCTGCATC TCACCGGCGA TTTCCACGCG ATCACCAGCG CGCATAATCT CCTCGCCGCC
TTGATCGACA ATCATCTTTA CTGGGGCGCC GAGCCCAAAA TCGATTCCCG CAAAGTCGCG
TGGCGTCGCG TGCTCGACAT GAACGACCGT GCCTTGCGTC AAATCGTGGT CGGTCTGGGG
GGAGGGGGCA ATGGCTACCC CCGTGAAACA GGTTTCGACA TTACCGCCGC TTCCGAGATT
ATGGCGATCT TCTGCCTTTC GAAAGATCTC GCGGATCTGC AACAAAGGCT CGCACAAATC
ATCGTCGCGC AGGATGTCAA CAAACAGCCC GTGCGTGCTG ATGCGTTGCA GGCCGTCGGC
GCCATGACAG TCCTGCTCAA GGACGCGCTC ATGCCCAATC TGGTGCAAAC GCTCGAAGGC
ACGCCCACTT TCGTTCATGG CGGTCCCTTT GCCAATATTG CCCATGGCTG CAATTCAGTG
GCGGCGACGC TGGCCGCCAT GCAACTTGGC GATTACGTCG TGACGGAAGC AGGCTTCGGC
GCCGATTTGG GGGCCGAGAA ATTTCTGGAC ATCAAATGCC GCCAGGCGGG GATCGCGCCC
TCCGCGGCGG TGATCGTTGC GACGGCCCGT GCCTTGAAAT CGCATGGCGG TGTCGCTCCG
GCCGATCTCA ATAAGGAAAA TCTCGACGCC CTCAAGGCGG GCCTCGCCAA TCTCGGGCGC
CATATCGCCA ATGTCAAAAA GTTCGGGCTG CCGGTTGTCG TGGCGATCAA TCATTTCCTT
TCGGATACAG AGGCGGAACA GGAACTGATT GCGCATACAT GCCGCGATGA ATACGGGGTC
GAGGCGATTG ATTGCCGGCA TTGGGCGGCC GGTGGCAAGG GCGCCCTGGC GCTGGCTGAA
AAGGTGATCG CCTTGGTCGA GGGTGGCACG GCCCAATTCA AGATGCTGTA TGAAGATACT
TTGCCACTCA TTGAGAAAAT GCGCCGCATC GCGCAGGAAA TCTATGGCGC AGCGGATATT
TCCCTGGACG CAAAGGCCAA GAAACAGCTT GCCGATATTG AGGCGCAGGG GTTCGGTCAT
TTCCCGGTCT GTGTCGCGAA AACCCAATAT TCCTTCGCTG CCGATCCGAA ACTACTCGGC
GCGCCAACGG GCCATATCGT ACCCATTCGC GAAGTCCGGC TCTCCGCCGG GGCCGGCTTC
GTCGTGATGA TCTGCGGTGA CATCATGACC ATGCCGGGGC TCTCCCGCCA GCCAGCGGCC
TGGAAGATCG GCCTCGATGC GCAAGGTAAT ATTGAAGGGC TGTTTTAA
 
Protein sequence
MSSDLEIARA AKLRPIATVA DEAKIPAEAL HSYGLHVAKI DTSLLPKKDR PAKLVLVTAI 
NPTPAGEGKT TTTIGLGDAL RRLGKACVIA LREPSLGPCF GTKGGATGGG YAQIVPMERI
NLHLTGDFHA ITSAHNLLAA LIDNHLYWGA EPKIDSRKVA WRRVLDMNDR ALRQIVVGLG
GGGNGYPRET GFDITAASEI MAIFCLSKDL ADLQQRLAQI IVAQDVNKQP VRADALQAVG
AMTVLLKDAL MPNLVQTLEG TPTFVHGGPF ANIAHGCNSV AATLAAMQLG DYVVTEAGFG
ADLGAEKFLD IKCRQAGIAP SAAVIVATAR ALKSHGGVAP ADLNKENLDA LKAGLANLGR
HIANVKKFGL PVVVAINHFL SDTEAEQELI AHTCRDEYGV EAIDCRHWAA GGKGALALAE
KVIALVEGGT AQFKMLYEDT LPLIEKMRRI AQEIYGAADI SLDAKAKKQL ADIEAQGFGH
FPVCVAKTQY SFAADPKLLG APTGHIVPIR EVRLSAGAGF VVMICGDIMT MPGLSRQPAA
WKIGLDAQGN IEGLF