Gene Arth_2901 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_2901 
Symbol 
ID4444423 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp3267244 
End bp3268941 
Gene Length1698 bp 
Protein Length565 aa 
Translation table11 
GC content67% 
IMG OID639690724 
Productformate--tetrahydrofolate ligase 
Protein accessionYP_832380 
Protein GI116671447 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG2759] Formyltetrahydrofolate synthetase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCTGACA ACAACGTCAT GAGTGACCTG GAAATCGCCC AGCGCGCCAC CATGCGCCCC 
ATCGGTGACA TCGCCGCCGC GGCAGGCATC AACGCCGACG CCGTGGAACT CTATGGCCGG
TACAAGGCGA AAATCGATCC GGCGAAGCTT ATAGATCCGG CAGGGGCACC GAAGCGCCTG
CCGGGCAAGG TGGTGCTCGT CTCCGCGATG TCCCCCACTC CGGCGGGCGA GGGGAAGTCC
ACCACCACGG TGGGTCTGGC GGATTCCCTC GCCCGGGCGG GCCGGAACGT GATGATTGCG
CTGCGTGAGC CGTCCCTGGG TCCCGTTCTT GGGATGAAGG GCGGGGCCAC CGGCGGAGGC
TACTCCCAGG TGCTGCCCAT GGAGGAAATC AACCTGCACT TCACGGGCGA CTTCCACGCC
ATCACCTCGG CGAACAACGC GCTCATGGCC CTGGTGGACA ACCATATTTT CCAGGGCAAC
GAACTCAACA TCGATCCCCG CAGGATGACC TTCAAACGGG TCCTCGACAT GAACGACCGC
TCCCTCCGCG AAGTGATCAT CGGCCTCGGC GGTCCGGCCC AGGGCGTCCC GCGGCAGGAC
GGCTTCGACA TCACCGTGGC TTCGGAGATC ATGGCGGTCT TCTGCCTGGC CACGGACATC
GCCGATCTCC GTGCCCGGCT GGGCCGCATC ACCTTCGGCT ATACCTATGA CCGCGAACCC
GTCACGGTGG CGGACCTCGG GGTCCAGGGT GCACTGACCA TGCTGCTCAG GGATGCGATC
AAGCCCAACC TCGTGCAGAC CATCGCCGGC ACTCCGGCCC TGGTGCACGG CGGCCCGTTT
GCCAACATCG CACACGGCTG CAACTCGCTG ATCGCCACCC AGACCGCCCG CCGGCTCGCG
GACATCGTGG TCACCGAAGC CGGCTTCGGC GCAGACCTGG GCGCGGAAAA GTTCATGGAC
ATCAAGGCGA GGGTGGCAGG CGTGGCGCCG TCCGCCGTCG TGCTCGTGGC AACCGTACGG
GCGCTGAAGA TGCACGGCGG TGTGGCCAAG GACCGGCTGC AGGAGCCCAA CGTGGAGGCG
CTGGCCGCCG GATCGGCAAA TCTCCGGCGG CACATCCGCA ACGTGGAGAA GTTCGGAATC
ACTCCCGTGG TGGCCGTCAA CAAGTTCGCC ACTGACACCC CGGAGGAGTT GGACTGGCTG
CTGGAGTGGT GCGCCGGCGA AGGGGTGCAG GCCGCGGTGG CAGACGTCTG GGGTCGCGGC
GGCGGCGGCG ACGGCGGCGA CGAGCTCGCG GCCAAGGTGC TCGCGGCGCT CGAGGCGCCG
CACAGCTTCC GGCACCTCTA CCCGCTGGAG CTGTCTGTGG AGGACAAGAT CCGCACCATC
GTGCAGGAAA TGTACGGGGC CGACGGCGTG GACTTCTCCG TTCCCGCCCT CAAGCGCCTT
GCCGAAATCG AGAAGAACGG CTGGGCCGGC ATGCCCGTCT GCATGGCCAA GACCCAGTAC
TCCTTCAGTG ACGACGCCAC CCGCCTGGGC GCACCGAAAG GCTTCACGGT CCATGTACGG
GACCTCATCC CCAAGACCGG GGCGGGTTTC ATCGTGGCCC TGACCGGCGC GGTGATGACG
ATGCCGGGTC TGCCCAAGGT TCCGGCAGCC CTGCGGATGG ACGTGGACGA CACCGGCAAG
CCCCTCGGCC TCTTCTAG
 
Protein sequence
MSDNNVMSDL EIAQRATMRP IGDIAAAAGI NADAVELYGR YKAKIDPAKL IDPAGAPKRL 
PGKVVLVSAM SPTPAGEGKS TTTVGLADSL ARAGRNVMIA LREPSLGPVL GMKGGATGGG
YSQVLPMEEI NLHFTGDFHA ITSANNALMA LVDNHIFQGN ELNIDPRRMT FKRVLDMNDR
SLREVIIGLG GPAQGVPRQD GFDITVASEI MAVFCLATDI ADLRARLGRI TFGYTYDREP
VTVADLGVQG ALTMLLRDAI KPNLVQTIAG TPALVHGGPF ANIAHGCNSL IATQTARRLA
DIVVTEAGFG ADLGAEKFMD IKARVAGVAP SAVVLVATVR ALKMHGGVAK DRLQEPNVEA
LAAGSANLRR HIRNVEKFGI TPVVAVNKFA TDTPEELDWL LEWCAGEGVQ AAVADVWGRG
GGGDGGDELA AKVLAALEAP HSFRHLYPLE LSVEDKIRTI VQEMYGADGV DFSVPALKRL
AEIEKNGWAG MPVCMAKTQY SFSDDATRLG APKGFTVHVR DLIPKTGAGF IVALTGAVMT
MPGLPKVPAA LRMDVDDTGK PLGLF