Gene Franean1_6424 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_6424 
Symbol 
ID5674739 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp7803706 
End bp7805097 
Gene Length1392 bp 
Protein Length463 aa 
Translation table11 
GC content73% 
IMG OID641245272 
Productcystathionine beta-synthase 
Protein accessionYP_001510667 
Protein GI158318159 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0031] Cysteine synthase 
TIGRFAM ID[TIGR01137] cystathionine beta-synthase 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.510477 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.13322 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACGTGT ACGACCACGT CACCGACCTG GTCGGTGACA CGCCACTGGT GCGGCTGACG 
CCGGTGATCG CCGACACGGT GACACCCGTG CTCGGCAAGC TCGAGTACCT CAACCCGGGC
GGCTCGGTGA AGGACCGCAT CGCCCTGTCG ATGGTGGCGG CGGCTGAGCG CGACGGCCGG
CTCACCCCCG GCGGGACCAT CGTCGAGCCC ACCAGCGGGA ACACCGGTGT GGGCCTGGCC
ATGGTCGCCG CGCGACGCGG CTATCGCTGT GTCTTCACCA TGCCGGACAA GATCAGCGAG
GAGAAGCGGG CCGTCCTGCG GGCCTACGGG TCCGAGGTGA TCGTCTGCCC GACGGCCGTC
GCGCCGGACG ACCCCCGCTC CTACTACTCG GTCGCCCGCC GGGTGCTGAG CGAGACCCCC
GGCGCCTGGA GCCCCGACCA GTACTCCAAC CCTGACAACC CGGCCGCGCA CGAGGCCTCC
ACCGGCCCGG AGATCTGGCG GGCCACCGAC GGCCGGGTGA CGCACTTCGT CGCCGGCATC
GGCACCGGCG GGACGATCAG CGGGACGGGC CGCCATCTCA AGGCGGTCAG CGGCGGCACC
GTGCAGGTGA TCGGCGCCGA TCCCGAGGGC TCGGTCTACT CCGGCGGCAG CGGCCGGCCC
TACCTGGTCG AAGGCGTCGG CGAGGACATC TGGCCGACGA CCTACGACAA GTCCGTGGTC
GACCGGGTGG AGGCCGTCAG CGACCGCGAC TCGTTCCTGA TGACGCGCGA GCTGGCCCGC
CGGGCCGGGA TCCTCGTCGG TGGCTCCTGC GGCCTGGCGG TCGTCGCCGC GCTGCGGGTC
GCCCGCGAGC TCGACGCGGC CGGGACGACC GACGCCTGCG TCGTGGTGCT GCTCCCCGAC
TCCGGCCGCG GCTACCTGTC GAAGATCTTC AACGACGAGT GGATGTACGA CAACGGGTTC
CTCGACCCGC CGTCCGACGA GCCGACGGTC GCCTCGGTGC TCGCGCACAA GGCCGCGCAG
ACGTCCGGGC CGCCGAACCT CGTCCACGTG CATCCCGACG AGACCGTCGG GGCGGCCATC
TCGTACCTGC GGGAGTATGG GGTCTCGCAG ATGCCGGTGG TGCGCCACGA GCCGCCGGTG
CGGGCCGCGG AGGTGGCGGG CGCCGTGCTG GAGCGCGAGC TGCTCGACGC AGTCTTCGCC
GACCGGGGGA CGGTGGACGC GCCCGTCGCC GACCACATGT CGCCACCGCT GCCGACGGTC
GGCGCCGGCG AGCCCGTCTC GGTGCTGGTG AGCGCGCTCG GCGAGAACCC GGCCGCCCTC
GTCCTGGACG AGGGGAACCC GACGGGGATC CTCACCCGGG CCGACCTGCT GGGGTTCCTC
GCCGTCCGCT GA
 
Protein sequence
MDVYDHVTDL VGDTPLVRLT PVIADTVTPV LGKLEYLNPG GSVKDRIALS MVAAAERDGR 
LTPGGTIVEP TSGNTGVGLA MVAARRGYRC VFTMPDKISE EKRAVLRAYG SEVIVCPTAV
APDDPRSYYS VARRVLSETP GAWSPDQYSN PDNPAAHEAS TGPEIWRATD GRVTHFVAGI
GTGGTISGTG RHLKAVSGGT VQVIGADPEG SVYSGGSGRP YLVEGVGEDI WPTTYDKSVV
DRVEAVSDRD SFLMTRELAR RAGILVGGSC GLAVVAALRV ARELDAAGTT DACVVVLLPD
SGRGYLSKIF NDEWMYDNGF LDPPSDEPTV ASVLAHKAAQ TSGPPNLVHV HPDETVGAAI
SYLREYGVSQ MPVVRHEPPV RAAEVAGAVL ERELLDAVFA DRGTVDAPVA DHMSPPLPTV
GAGEPVSVLV SALGENPAAL VLDEGNPTGI LTRADLLGFL AVR