Gene Noca_4561 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoca_4561 
Symbol 
ID4597080 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardioides sp. JS614 
KingdomBacteria 
Replicon accessionNC_008699 
Strand
Start bp4826186 
End bp4827865 
Gene Length1680 bp 
Protein Length559 aa 
Translation table11 
GC content72% 
IMG OID639779172 
Productformate-tetrahydrofolate ligase 
Protein accessionYP_925745 
Protein GI119718780 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG2759] Formyltetrahydrofolate synthetase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGCTGTCCG ACATCGAGAT CGCCGGGGCC GCCACGCTGC GGCCGATCAC CGAGGTAGCG 
ACCGAGTCGC TCGGCATCGG GGCCGAGCAC CTGGTGCCGT ACGGCCACTA CAAGGCGAAG
GTCGGCATCA CCTACCTGAA CTCGCTCGCC GACCGCCCGC TCGGCCGGCT GATCCTGGTG
ACGGCGCTCT CGCCGACCCC GCCGGGCGAG GGGAAGACGA CCACGTCGGT GGGGCTCACC
GACGCGCTGC ACGGGCTCGG CAAGCGGGCC ATCGCGTGCC TGCGCGAGCC GTCGATGGGT
CCGGTCTTCG GTCTCAAGGG CGGGGCCGCC GGCGGCGGCT ACAGCCAGGT GGTGCCGATG
ACCGACATCA ACCTGCACTT CACCGGGGAC TTCGCCGCGA TCGCCGCGGC CAACAACCTG
CTCGCCGCGC TGATCGACAA CCACGTGCAC CACGGCAACG AGCTCGACAT CGACGTGCGA
AGCGTGACCT GGAAGCGGGT GCTCGACACC AACGACCGCG CGCTGCGCGA GGTGGTCGTC
GGGCTGGGCG GCCCGCCCAA CGGGTTCCCC CGCCAGGACG GCTTCGACAT CGTGGTCGCC
TCGGAGCTGA TGGCGATCTT CTGCCTCACC GAGTCCTGGG CCGACCTGAA GCGCCGCATC
GGCGACATCG TGATCGGCTA CTCCCGTGCC GGCGCGCCGG TCACCGCCCG CGACCTCGGC
GCCGACGGCG CGATGGCGGT GCTGCTGCGC GACGCGATCG CGCCGAACCT CGTGCAGACC
CTCGAGGGCG CACCGGCGCT GGTCCACGGC GGGCCGTTCG CCAACATCGC ACACGGCTGC
AGCTCGGTGA TGGCGACGCG GGCCGGCCTG CGGCTGGCCG ACTACGTCGT CACCGAGGCG
GGGTTCGGGG CCGACCTCGG CGCGGAGAAG TTCATCGACA TCAAGTGCCG GATGTCCGGG
ATGCGCCCCG ACGTCGCGGT CGTCGTCGCG ACGGTGCGGG CCCTGAAGTA CCACGGCGGC
GTGGCCCTGG CCGACCTGGA CCGCGAAGAC CTGGGCGCGG TCGAGGCCGG GATGGACAAC
CTGCGTCGCC ACCTGGACAA CCTGCGGCAC CTGAACGGCG TCCCGTGCGT GGTCGCGGTC
AACCGGTTCC CCACGGACAC CGACCTCGAG GTGGTCAGGG TCGTCGAGCT CGCCGCGTCG
TACGGCGTCC CCGCCTATCA GGCCACCCAC TTCACCGACG GCGGCATCGG GGCGCAGGAC
CTCGCCAAGG GCGTCCTGCA GGCGCTCGAG GAGCCGGCGC GCGACGAGTT CTCCTTCACC
TACCCCGACG AGCTGTCCCT GACCGAGAAG GTCGAGGCGG TCGCGACCCG GGTGTACGGC
GCCGGCCAGG TCACCTGGGA CGGCAAGGCG CGCAAGCGGC TGGCGCGCAT CGAGCGCGAC
GGGTACGGCA CGCTGCCGGT CTGCGTGGCG AAGACGCAGT ACTCGTTCTC GACCGACCCG
GGCCTCCTCG GGGCGCCCAC CGGTCACGAG CTCCGGGTCC GCGAGGTCCG GCTGTCGGCG
GGCGCGGGCT TCGTGGTGGT GATCTGCGGC GACATGATGA CCATGCCCGG CCTGCCCACG
CGCCCGGCCG CGACCCGGAT CGACCTCGCC GACGACGGCA CGATCATCGG GCTGTCCTAG
 
Protein sequence
MLSDIEIAGA ATLRPITEVA TESLGIGAEH LVPYGHYKAK VGITYLNSLA DRPLGRLILV 
TALSPTPPGE GKTTTSVGLT DALHGLGKRA IACLREPSMG PVFGLKGGAA GGGYSQVVPM
TDINLHFTGD FAAIAAANNL LAALIDNHVH HGNELDIDVR SVTWKRVLDT NDRALREVVV
GLGGPPNGFP RQDGFDIVVA SELMAIFCLT ESWADLKRRI GDIVIGYSRA GAPVTARDLG
ADGAMAVLLR DAIAPNLVQT LEGAPALVHG GPFANIAHGC SSVMATRAGL RLADYVVTEA
GFGADLGAEK FIDIKCRMSG MRPDVAVVVA TVRALKYHGG VALADLDRED LGAVEAGMDN
LRRHLDNLRH LNGVPCVVAV NRFPTDTDLE VVRVVELAAS YGVPAYQATH FTDGGIGAQD
LAKGVLQALE EPARDEFSFT YPDELSLTEK VEAVATRVYG AGQVTWDGKA RKRLARIERD
GYGTLPVCVA KTQYSFSTDP GLLGAPTGHE LRVREVRLSA GAGFVVVICG DMMTMPGLPT
RPAATRIDLA DDGTIIGLS