Gene Snas_3876 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSnas_3876 
Symbol 
ID8885076 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStackebrandtia nassauensis DSM 44728 
KingdomBacteria 
Replicon accessionNC_013947 
Strand
Start bp4141457 
End bp4142644 
Gene Length1188 bp 
Protein Length395 aa 
Translation table11 
GC content64% 
IMG OID 
Productaminodeoxychorismate lyase 
Protein accessionYP_003512624 
Protein GI291301346 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTGGACG AGAGGCCGTA CGAGACCATG GATCCCTCTC AGCCTCGCCG ACGCAGGCAC 
CGCCGTCGCC GCAAGGGCCG CACCGTTTTC GCGTTGCTGC TGGTGTTCGT GTTGCTGGGA
ACCGTTGGTG TGGTCGGGTT CGTCGGCTTC GACAAGATCA AGAACATCTT CTCCGCGCCC
GACTACGCCG GTGACGGCAA CGGCGTCAAG GTGCAGGTCG AGATCGCCGA GGGCTCGGTG
CTGTCCGACA TCGGTGACGC ACTGTACAAG AAGGACGTCG TCAAGAGCGC CAACGCCTTC
GTCAACGCCG CCGAGGCCAA CCCGAAGTCG AACCAGATCG GGCCGGGCAC CTACGCCATG
GAGAAGCAGA TGTCGGGCGA GGCGGCGCTC GAACGGATGC TGGACCCGAA GTCCCGCAAG
GTTTCCGGCG TCACGATCCG TGAGGGTCTG ACCATGTGGG GGACGTTCAA GAAGCTGTCG
GAGAACACCG GGGTGCCGCT CGAGGACTTC ACCGCCGCCG CCGAGGATCC CGAAGCGCTC
GGCATCACCA GCGACTGGTT CGAGCGCAAG GACGGCAAGG ACGTCGTCAA GTCCGTCGAG
GGTTTCCTGT CTCCCGCCAC CTACGAGTTC AAGAAGGGCG CCACCGCCGA GGAGATGCTG
AAGGCGATGG TGAGCAACTT CCTCAAGGTC ACCGACTCGA TCGGGTTCAA GGAGACCGTG
GAGGCGCAGC GCTCGAACTA CAGCCCCTAC GAGGTGCTGA TCGTGGCGTC GCTGTCGGAG
GCCGAGGCGG GTGTCCCCAA GGACCTCGGC AAGATCGCGC GGGTGGCGTA CAACCGCATG
GACGGTGAGT ACTGGTGCCA CGGCGGTCTG GAGAACTGCC TCGAGTTCGA CACCACCACG
AACTACGGCC TCATCGAGGC GGGCAAGGGC AGCAAGAACT CCAAGGACCT CACCGACGCG
GAGTTGAACG ACGAGAGCAA CAAGTGGTCG ACGCACGTGC GTGCCGGACT GCCGCCGACT
CCGATCAACA GCCCGGGCAA GAGCGCGCTG GAAGGCGCCG CCGACCCGCC GTCGGGCAAG
TGGAAGTTCT TTGTGGCCAT CGACAAGGAG GGCAACTCCG CCTTCGCCGA GACCAAAGAG
GAACACGACG CGAACGTGGA AGAGGCCAGG AAGAACGGCG TCCTGTGA
 
Protein sequence
MLDERPYETM DPSQPRRRRH RRRRKGRTVF ALLLVFVLLG TVGVVGFVGF DKIKNIFSAP 
DYAGDGNGVK VQVEIAEGSV LSDIGDALYK KDVVKSANAF VNAAEANPKS NQIGPGTYAM
EKQMSGEAAL ERMLDPKSRK VSGVTIREGL TMWGTFKKLS ENTGVPLEDF TAAAEDPEAL
GITSDWFERK DGKDVVKSVE GFLSPATYEF KKGATAEEML KAMVSNFLKV TDSIGFKETV
EAQRSNYSPY EVLIVASLSE AEAGVPKDLG KIARVAYNRM DGEYWCHGGL ENCLEFDTTT
NYGLIEAGKG SKNSKDLTDA ELNDESNKWS THVRAGLPPT PINSPGKSAL EGAADPPSGK
WKFFVAIDKE GNSAFAETKE EHDANVEEAR KNGVL