Gene Francci3_1998 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_1998 
Symbol 
ID3903706 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp2348170 
End bp2349399 
Gene Length1230 bp 
Protein Length409 aa 
Translation table11 
GC content70% 
IMG OID637879334 
ProductO-succinylhomoserine sulfhydrylase 
Protein accessionYP_481101 
Protein GI86740701 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0626] Cystathionine beta-lyases/cystathionine gamma-synthases 
TIGRFAM ID[TIGR01325] O-succinylhomoserine sulfhydrylase 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.469724 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGCCTC CGTCCTCCAA CCGTACGGCC TCCGGCACCC GCCGCGGACT GGCGACGGAG 
GCGGTGCGTG CCGGCCATCG CCAGTCCGTT GACGACCAGC ACAGCGAGGC GCTGGTACTC
ACGTCGAGCT ACCTGTTCGA CGACTCGCAC GACGCGGCGG AGAAGTTCGC GCAGCGGCGC
CCCGGCAACG TCTACGTCCG GTTCACGAAC CCGACGGTGC GCGCGTTCGA GGAGCGCGTC
GCGCGGCTGG AGGGTGCCGA GTCCGCCGTA GCGACCGCGT CCGGGATGGC CGCATTCTTG
GCGGTGTCGC TCGGGCTGCT GCGCGGCGGG GACCATGTCC TGCTTGCGGA GGGTGTCTTC
GGCACCACGA CCCGGCTCTA TGCTCACTAT CTGGACCGGT TCGGCGTCAT GACGACCGTC
GTTCCGGTGA CCGACCCGGC CGCCTGGGCG CGCGCAATGC GTGCGCAGAC CAGGATGCTC
GTCGTTGAGA GCCCCACGAA CCCGGTAATG GCCGTGGCCG ACATCAGGTA CCTCGCAGAG
CTCGCACATG CCGCCGGCGC GCTGCTGCTG GTCGACAACA CCCTGTGCAC CCCGGTGTTC
CAGCAACCGA TCGTGTTTGG CGCAGACCTC GTTCTGCACT CCGCCGGCAA GTACATCGAC
GGTCAGGGCC GCTGCGGCGG CGGCGTTGTC GCCGGCCGCG CGGGCCTGAT CTCCGAGCTG
CACGGTGTGC TGCGCACCGC GGGCCCGAGC CTCAGCCCGT TCAACGCGTG GATCTTCCTG
AAGAGCCTGG AGACGCTGCC GGTGCGGATG CGGGCGCACG ACGCCAACAC GGCGGTGGTG
GCTGCCTGGC TAGCCGACCA ACCGGACGTA CGGGCGGTGC ACTACACCGG CAGCGCGGAT
CACCCGCAGC GGGAGCTGGT AGCCGCCCAA CAGTCCGGGC ACGGCGGAGT GATCAGCTTC
GAGCTGTACG GCGGCCAGCA GGCCAGCTGG TCGTTTGTCG ACCGGCTTGA GCTCGTGTCG
AACACGACAA ACATCGGGGA CACCAAGTCG ATGATCACCC ATCCGGCAAG CACCACCCAC
GGCCGGCTCA CGCCGGCACA GCGCGACTCC GCCGGCGTCA CCGACGGCCT GCTGCGCCTA
TCGGTCGGCC TGGAGGACGT CGAGGACATC GTCGCCGATC TGGCCCGGGC GTTCGCAGCG
ACCAGGCCTG CCGGGGCCCG CGCCCGATGA
 
Protein sequence
MTPPSSNRTA SGTRRGLATE AVRAGHRQSV DDQHSEALVL TSSYLFDDSH DAAEKFAQRR 
PGNVYVRFTN PTVRAFEERV ARLEGAESAV ATASGMAAFL AVSLGLLRGG DHVLLAEGVF
GTTTRLYAHY LDRFGVMTTV VPVTDPAAWA RAMRAQTRML VVESPTNPVM AVADIRYLAE
LAHAAGALLL VDNTLCTPVF QQPIVFGADL VLHSAGKYID GQGRCGGGVV AGRAGLISEL
HGVLRTAGPS LSPFNAWIFL KSLETLPVRM RAHDANTAVV AAWLADQPDV RAVHYTGSAD
HPQRELVAAQ QSGHGGVISF ELYGGQQASW SFVDRLELVS NTTNIGDTKS MITHPASTTH
GRLTPAQRDS AGVTDGLLRL SVGLEDVEDI VADLARAFAA TRPAGARAR