Gene Snas_1820 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSnas_1820 
Symbol 
ID8883011 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStackebrandtia nassauensis DSM 44728 
KingdomBacteria 
Replicon accessionNC_013947 
Strand
Start bp1908884 
End bp1910581 
Gene Length1698 bp 
Protein Length565 aa 
Translation table11 
GC content71% 
IMG OID 
ProductFormate--tetrahydrofolate ligase 
Protein accessionYP_003510609 
Protein GI291299331 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value0.538152 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCATGC CCAGTGACCT CGACATATCC CGCGAGACCG TGACCCTGCC GCTGTCGGAC 
ATCGCCGAGC GCTGCGGGAT CCCGGCGTCG TTGACCGAAC CCTACGGCCG CTTCGCCGCC
AAGGTCTCGC TGGACGCCGT GGCGGCGCTG TCGCAGCGGC CGCGCGGCAA GTACGTGGTG
GTCACCGCGA TCACGCCCAC CCCGCTGGGG GAGGGCAAGA CCACCGTGAC CGTCGGGCTG
GGCCAGGGAC TCAACCACAT CGGCAAACGG GCGGCCATCG CGATCCGGCA GCCGTCCATG
GGCCCGACCT TCGGCATCAA GGGCGGCGCG GCCGGTGGCG GCTACGCCCA GGTGGTGCCG
ATGGAGACCA TCAACCTGCA CCTGACCGGC GACATGCACG CCGTCACCGC CGCCCACAAC
CTGCTGGCGG CCATGATCGA CAACCACCTC CACAAGGGAA ACAGTCTGGG CATCGACCCG
CACTCGGTGA CCTGGCGGCG GGTGCTGGAC GTCAACGACC GTGACCTGCG CGACATCGTC
ACCGGACTGG GGAGCCGCGC CGACGGCACC CCGCGCCAGA CCGGTTTCGA CATCACCGCC
GCCAGCGAGG TCATGGCGAT CCTGGCACTG TCCACTTCGC TGGCCGACCT GCGAAACCGC
CTGGGGCGCA TCGTCATCGG CTTCACCGCC GACGGCGCCG CCGTCACCGC CGAGGACCTC
AAAGCGGCCG GGGCGATGTG CGTGATCCTG CGCGACGCCC TCAACCCCAA CCTGATGCAG
ACCGTCGAGG GCACCCCCGC GTTCGTGCAC TGCGGACCGT TCGGGAACAT CGCGCACGGC
AACTCGTCCA TCGTGGCGGA CCTGATCGGA CTGCGCAGCG CCGACTACCT CGTCACCGAG
GCCGGTTTCG GCGCCGACAT GGGTGCCGAA CGCTTCTTCA ACATCAAGTG CCGCGCATCG
GGTCTCACCC CGGACGCGGC GGTCCTGGTG GCGACAGTCC GGGGACTCAA AGCCCACAGT
GGCCGGTACC GGATCGTCGC CGGACGGCCG CTGCCGCCCG AACTGCTGGC CGAGAACCCC
GGCGACGTCG AGGCCGGTGC CGACAACCTG CGCCGCCAGA TCGCCAACGT CCGCCGCCAC
GGCGTGTCCC CGGTGGTGGC CGTCAACGCC TTCGACACCG ACCACGACAG CGAACACCGC
GTCATCGCCG ACATCGCCGC CGCCGAGGGC GCGCACGTCG CCATCAGCAG CCACTTCACC
CGGGGCGGCA AGGGCGCCGC CGAACTGGCC GAGGCGGTGG TCGCCGCCTG CGACGAACCC
GGCCGGTTCA CGCCGCTGTA CCCCGACGAG GCCTCGCTGA CCGACAAGGT CGAGACCGTG
GCGCGCGAGA TCTACGGCGC CGACGGCGTC GACTTCGCGC CCGCCGCCGC CAAACGCCTG
GCCGCCTACG AATCCGGCGG CTACGGACGG CTGCCGGTGT GCATCGCCAA GACGCACCTG
TCGCTGTCGC ACGACCCGGC GCTCAAGGGC GCGCCGACCG GCTGGCGGCT GCCGGTGCGG
GAGGCGCGGC TGTCGGCGGG CGCGGGCTTC GTGTACCTGG TGTGCGGCGA CATGCGCACC
ATGCCGGGCC TGTCCTCGGC CCCGGCCGCC GAACGCATAG ACATCGACGA ACAAGGACGG
GTGGTGGGCT TGTCATGA
 
Protein sequence
MTMPSDLDIS RETVTLPLSD IAERCGIPAS LTEPYGRFAA KVSLDAVAAL SQRPRGKYVV 
VTAITPTPLG EGKTTVTVGL GQGLNHIGKR AAIAIRQPSM GPTFGIKGGA AGGGYAQVVP
METINLHLTG DMHAVTAAHN LLAAMIDNHL HKGNSLGIDP HSVTWRRVLD VNDRDLRDIV
TGLGSRADGT PRQTGFDITA ASEVMAILAL STSLADLRNR LGRIVIGFTA DGAAVTAEDL
KAAGAMCVIL RDALNPNLMQ TVEGTPAFVH CGPFGNIAHG NSSIVADLIG LRSADYLVTE
AGFGADMGAE RFFNIKCRAS GLTPDAAVLV ATVRGLKAHS GRYRIVAGRP LPPELLAENP
GDVEAGADNL RRQIANVRRH GVSPVVAVNA FDTDHDSEHR VIADIAAAEG AHVAISSHFT
RGGKGAAELA EAVVAACDEP GRFTPLYPDE ASLTDKVETV AREIYGADGV DFAPAAAKRL
AAYESGGYGR LPVCIAKTHL SLSHDPALKG APTGWRLPVR EARLSAGAGF VYLVCGDMRT
MPGLSSAPAA ERIDIDEQGR VVGLS