Gene BAS1959 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBAS1959 
Symbol 
ID2851348 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus anthracis str. Sterne 
KingdomBacteria 
Replicon accessionNC_005945 
Strand
Start bp1965290 
End bp1966978 
Gene Length1689 bp 
Protein Length562 aa 
Translation table11 
GC content38% 
IMG OID637505209 
Productformate--tetrahydrofolate ligase 
Protein accessionYP_028222 
Protein GI49184970 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG2759] Formyltetrahydrofolate synthetase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACAACTA CTACAACAGT TAAATCCGAT ATTGAAATCG CACAAGAAGC GAATATGAAA 
AAGATTCAAG AAATTGCAGC TGATTTAAAT ATTTTAGAAG ATGAATTAGA GCCATACGGG
CATTATAAAG GTAAGTTATC TCTTGATATT TTTAAGCGCT TACAAAATGA GAAAGACGGT
AAAGTTGTTT TAGTAACAGC GATTAACCCA ACTCCAGCTG GAGAAGGTAA ATCAACAGTA
ACAGTTGGTT TAGGTCAAGC TTTTAATAAA ATTGGTAAGA AAACAGTAAT TGCACTTCGC
GAACCATCTC TTGGACCAAC GATGGGGCTA AAAGGCGGAG CAGCAGGTGG TGGTTTTTCA
CAGGTTGTAC CAATGGAAGA CATTAACCTT CACTTTACTG GAGATATCCA TGCGATCACA
ACTGCTAATA ACGCGTTAGC CGCGTTTATT GATAATCATA TCCAACAAGG AAATACACTT
GGAATTGATA CGCGTAAAAT CGTTTGGAAA CGTTGTGTTG ACTTAAATGA TCGTGCCCTT
CGTAACGTAG TAATTGGTCT TGGTGGACCG GTTCAAGGTG TACCACGTGA AGACGGTTTT
GATATTACAG TAGCATCTGA AATTATGGCC GTATTCTGCC TTGCGACAGA TATTCAAGAT
TTAAAAGCAC GTCTATCACG CATCGTAGTT GCTTATAATT TTGCAAATCA ACCTGTAACG
GTTAAAGATT TAGGTGTAGA AGGTGCGTTA ACATTATTAT TAAAAGATGC ATTAAAGCCA
AACTTAGTGC AAACGTTAGA AAATACACCA GCTATCATTC ATGGCGGACC ATTTGCGAAT
ATCGCTCATG GTTGTAACAG TGTTATCGCT ACAACAATGG CAGCAAAATT AGGTGATTAT
GTTATTACAG AAGCTGGATT TGGTGCAGAT TTAGGTGCTG AGAAGTTTTT AGATATTAAA
GCTCGTGCAG CTGGCATTAA ACCAGAAGCA GTTGTTATTG TTGCGACGAT TCGTGCGCTT
AAAATGCATG GTGGCGTAGC AAAAGATCAA TTAAAAGAAG AAAATGTAGA TGCATTAGCA
AAAGGTATGG AGAACTTACA GAAGCACGTT GAAACAATTC AAAGCTTCGG TGTGCCTTTC
GTAATTGCAA TTAATAAATT CATTACAGAT ACAGATGCAG AAGTTGCATA CTTACAAGAA
TGGTGTAATG AGCGTGGCTA TGCGGTATCC TTAACAGAAG TTTGGGAAAA AGGTGGCCAA
GGCGGAGTTG ACCTTGCTGA AAAAGTGTTA AAAGAAATTG AAAAAGGTGA AAACAACTAC
GCACCACTTT ATGAATTAGA ATTACCATTA GAAGAGAAAA TTCGTACAAT TGCTCAAAAA
GTGTATGGCG CAAAAGACAT TGAATTTGCT CCGAAAGCAC GTAAGCAATT AGCTCAATAT
GAAGGCGAAG GTTGGAGTAA CCTACCAATT TGTATGGCGA AAACACAATA CTCTCTTTCT
GACGATGCAA CGAAATTAGG TCGTCCATCT GACTTTATCG TTACAATTCG TGAGCTAAAA
CCATCTATTG GTGCAGGCTT TATCGTTGCG TTAACAGGAA CAATGTTAAC AATGCCAGGC
CTTCCAAAAC AACCAGCAGC ACTACAAATG GATGTAAATG AAGATGGAAA AGCAGTAGGT
TTATTCTAA
 
Protein sequence
MTTTTTVKSD IEIAQEANMK KIQEIAADLN ILEDELEPYG HYKGKLSLDI FKRLQNEKDG 
KVVLVTAINP TPAGEGKSTV TVGLGQAFNK IGKKTVIALR EPSLGPTMGL KGGAAGGGFS
QVVPMEDINL HFTGDIHAIT TANNALAAFI DNHIQQGNTL GIDTRKIVWK RCVDLNDRAL
RNVVIGLGGP VQGVPREDGF DITVASEIMA VFCLATDIQD LKARLSRIVV AYNFANQPVT
VKDLGVEGAL TLLLKDALKP NLVQTLENTP AIIHGGPFAN IAHGCNSVIA TTMAAKLGDY
VITEAGFGAD LGAEKFLDIK ARAAGIKPEA VVIVATIRAL KMHGGVAKDQ LKEENVDALA
KGMENLQKHV ETIQSFGVPF VIAINKFITD TDAEVAYLQE WCNERGYAVS LTEVWEKGGQ
GGVDLAEKVL KEIEKGENNY APLYELELPL EEKIRTIAQK VYGAKDIEFA PKARKQLAQY
EGEGWSNLPI CMAKTQYSLS DDATKLGRPS DFIVTIRELK PSIGAGFIVA LTGTMLTMPG
LPKQPAALQM DVNEDGKAVG LF