Gene Francci3_4397 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_4397 
Symbol 
ID3907372 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp5255296 
End bp5256573 
Gene Length1278 bp 
Protein Length425 aa 
Translation table11 
GC content71% 
IMG OID637881728 
Productthreonine synthase 
Protein accessionYP_483472 
Protein GI86743072 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0498] Threonine synthase 
TIGRFAM ID[TIGR00260] threonine synthase 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACACTCG CGCCCGAGCG TGCCACCAGT CCCACCCTCG AGGTCGCGAA CCCCGCGGTC 
TCGCTGTCCT GTCGGCACTG CGGCGCCACC TACCCGCTGT CCCCTACGCA TGTGTGCGTG
GAATGCTTCG CTCCTCTGGA AATCGCCTAC GACCCGGACC TGCTGGCTCG TGTGACCCGA
GAGTCCATCG AACGTGGTCC GGCGAACCTG TGGCGTTACG TCGGGCTGCT GCCCGCCGGG
CATGACCCGG CGCGGCGGGT CTCCCTCGGG GCCGGCTGGA CGCCACTGCG CCCGGCCCCG
AGGCTCGCCG CGGAGCTGGG GATGAAGACG CTCTATATCA AGGACGACAG CGCCAACCCC
ACCCACTCCT TCAAGGACCG GGTCGTCTCC GTGGCGCTCA CCGCCGCCCG TGAGCTGGGC
TTCACCACGG TCGCCTGCGC CTCGACGGGC AACCTCGCGC AGTCGGTGGC GGCGCACGCG
GCCGGTGCCG GGCTGCGCTC GGTGGTGCTC GTACCGCACG ACCTCGAGGC TGGCAAGACG
GTCTCCACCG GGGTGTACGG CGGGACGCTG GTCGCCATCC AGGGCAACTA CGACGACGTG
AACCGGTTGT GCAGCGAGCT GGCCGGAGAG TACGAATGGG CGTTCGTCAA CGTCAACGTC
CGCCCCTTCT ACGCCGAGGG ATCCAAGACG CTCGGCTACG AGGTCGCCGA GCAGCTCGGC
TGGCGGCTGC CCGAGCAGGT CGTCGTGCCG ATCGCCTCCG GTTCACTGCT AACAAAGATC
GACAAGGCCT TCGGTGAGCT CGGCCGGCTC GGCCTGGTCG AGCCGACGCC GTACAGGGTG
TTCGGTGCGC AGGCGGCCGG CTGCAACCCG GTGGCCGCCG CGTTCGCCCG CGGCGTCGAG
ACGGTCGCCC CGGTCCGGCC CGCCACAATC GCCAAGTCGC TGTCGATCGG CAACCCGGCC
GACGGCCCCT ACGCCCTCGA CGTGGCTCGG CGGACCGGCG GGGCTATCAC GGACGTGACC
GACGAGGAGA TCATCGACGG GATCCGGCTG CTCGCCCGCG CCGAGGGCGT CTTCGGCGAG
ACGGCCGGTG GCGTCACCGT GGCCACCCTG CGCAGGCTGC TGCGCGAGGG GCTGCTCGAC
CCGGCCGCCG AAACCGTCCT TTACAACACC GGCGATGGGC TGAAGACGCT CGACCCCCTC
GTCGAGACGG GCGGACCGAC CGCGACGATC AAGCCGTCGC TGCGCGCCTT CGAGGCCGCC
GGCCTCGGCG ACGACTGA
 
Protein sequence
MTLAPERATS PTLEVANPAV SLSCRHCGAT YPLSPTHVCV ECFAPLEIAY DPDLLARVTR 
ESIERGPANL WRYVGLLPAG HDPARRVSLG AGWTPLRPAP RLAAELGMKT LYIKDDSANP
THSFKDRVVS VALTAARELG FTTVACASTG NLAQSVAAHA AGAGLRSVVL VPHDLEAGKT
VSTGVYGGTL VAIQGNYDDV NRLCSELAGE YEWAFVNVNV RPFYAEGSKT LGYEVAEQLG
WRLPEQVVVP IASGSLLTKI DKAFGELGRL GLVEPTPYRV FGAQAAGCNP VAAAFARGVE
TVAPVRPATI AKSLSIGNPA DGPYALDVAR RTGGAITDVT DEEIIDGIRL LARAEGVFGE
TAGGVTVATL RRLLREGLLD PAAETVLYNT GDGLKTLDPL VETGGPTATI KPSLRAFEAA
GLGDD