Gene Sare_4520 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSare_4520 
Symbol 
ID5706010 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalinispora arenicola CNS-205 
KingdomBacteria 
Replicon accessionNC_009953 
Strand
Start bp5107956 
End bp5109290 
Gene Length1335 bp 
Protein Length444 aa 
Translation table11 
GC content73% 
IMG OID641273934 
Productglutamate-1-semialdehyde aminotransferase 
Protein accessionYP_001539283 
Protein GI159040030 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0001] Glutamate-1-semialdehyde aminotransferase 
TIGRFAM ID[TIGR00713] glutamate-1-semialdehyde-2,1-aminomutase 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.228286 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000903 
Fosmid HitchhikerNo 
Fosmid clonabilitydecreased coverage 
 

Sequence

Gene sequence
GTGACCGACG TGCTCCCAGC CGGACCCGGC CGCTACCCGG CCGCCGCGCC GGCCTCCGAG 
GCCCTGTTCG CCCGCGCCCG CGCCCTCGTG CCCGGCGGGG TGAACTCCCC TGTCCGCGCG
TTCCGTGCCG TCGGCGGCAC CCCGCGCTTC ATGGTCCGAG GGGAGGGTCC ATGGCTGTAC
GACGCCGACG GACGGCGCTA CGTCGACCTG GTCTGCTCGT GGGGCCCCAT GATCCTGGGG
CACGCGCACC CCGCGGTGGT GGAGGCGCTG CACTCGGCCG CCGCGCTCGG CACCAGCTTC
GGCGCCCCCA CCCCGGGTGA GGTGGAGTTG GCCGCGGAGA TCGTCGACCG CACGCCCGTC
GAGCAGGTAC GTCTGGTCAG CTCGGGCACC GAGGCCACCA TGTCGGCGAT CCGGCTGGCC
CGGGGCTGCA CCGGCCGCGC CCGGATCATC AAGTTCGCCG GCTGCTACCA CGGGCACTCG
GACGCACTGC TCGCCGCCGC CGGCTCCGGC GTCGCCACCT TCGGCCTGCC CGACTCGCCG
GGTGTGACCG ACGCGGCAGC CGGGGACACG ATCGTGCTGC CGTACAACGA CATTCAGGCA
GTCGAGGCGG CGTTCGCCGC CGAGGGCCCA CAGATCGCCG CGATCATCAC CGAGGCCGCC
GCCGGCAACA TGGGTGTGGT GGCTCCTCGC GACGACTTCA ACCAGCGACT CGCCGCCATC
GCCCACGCCA ACGGTGCACT GCTGATCGTT GATGAGGTCA TGACCGGCTT CCGGGTCTCC
CGAGCCGGGT GGCACGGCCT GGACGCCTGC CCGGCCGACC TGTGGACCTA TGGCAAGGTC
ATGGGTGGTG GCCTGCCCGC CGCCGCCTTC GGTGGCCGAG CGGAGATCAT GGCACAACTG
GCCCCCGCCG GTCCCGTCTA CCAGGCCGGC ACCCTCTCCG GTAACCCCCT CGCCTGCGCC
GCCGGGCTCA CCACGCTGCG GCTCGCCGAC GACGCCCTCT ACCGCAGGCT GGACGACACG
GCCGCCGTCG TGGGCCGGCT CGCCGGTGAC GCCCTCGCCG CCGCCGGGGT GCCGCACCGG
TTGTCGTACG CGGGCAACAT GTTCTCGATC TTCTTCACCG ACGCCGACGT GGTCGACTAC
GCGAGCGCGC GTACCCAGCA GGTGCCCGCG TTCAAGGCGT TCTTCCACGC CATGCTCGAG
GCCGGCGTCT ACCTGCCGCC GAGCGCCTTC GAGTCGTGGT TCGTCTCGGC GGCGATCGAC
GACACCGCCC TGGAGCAGAT CGCCGCGGCG CTGCCAGCGG CGGCAGCGGC AGCCGCGGCG
GGTCACGGGG GGTGA
 
Protein sequence
MTDVLPAGPG RYPAAAPASE ALFARARALV PGGVNSPVRA FRAVGGTPRF MVRGEGPWLY 
DADGRRYVDL VCSWGPMILG HAHPAVVEAL HSAAALGTSF GAPTPGEVEL AAEIVDRTPV
EQVRLVSSGT EATMSAIRLA RGCTGRARII KFAGCYHGHS DALLAAAGSG VATFGLPDSP
GVTDAAAGDT IVLPYNDIQA VEAAFAAEGP QIAAIITEAA AGNMGVVAPR DDFNQRLAAI
AHANGALLIV DEVMTGFRVS RAGWHGLDAC PADLWTYGKV MGGGLPAAAF GGRAEIMAQL
APAGPVYQAG TLSGNPLACA AGLTTLRLAD DALYRRLDDT AAVVGRLAGD ALAAAGVPHR
LSYAGNMFSI FFTDADVVDY ASARTQQVPA FKAFFHAMLE AGVYLPPSAF ESWFVSAAID
DTALEQIAAA LPAAAAAAAA GHGG