Gene Franean1_0040 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_0040 
Symbol 
ID5668466 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp47146 
End bp48660 
Gene Length1515 bp 
Protein Length504 aa 
Translation table11 
GC content74% 
IMG OID641238969 
Producthypothetical protein 
Protein accessionYP_001504414 
Protein GI158311906 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACGGCCC GGACCGGCCG CTACACCGCT GAGCAGTACG AACGCTGGCA CATCCGCGTC 
TGCGCGCGCT GCGGGCGGCG GGCGTCCCTA TCGGCGAACT GGTCGGACGG GCCGATCTGC
CGCAGCTGCT ACGACCGGGC CGCCCGCACC TACGGTCGCT GCCCCGGTTG CCAGGCCGAA
CGGCTCCTTC CCGGCCGCGA CGACGACGGG GCCGCCCTCT GCAGGGACTG CGCCGGCATC
ACCCGCGACT TCTTCTGCTC CCGGTGCGGT TTCGAGGGCC TGCTGCTCGG CGGCCGGCTC
TGCGAACGCT GCACCCTCAC CGACCAGCTC GCTGCCGCCC TCGACGACGG CACCGGGAAC
GTCAGCCCGC CGCTGGTGCC GCTGCTCGAC GCGCTGCGCG TGATGCCGAA GCCGAAGTCC
GGCCTGGCCT GGCTGCGCAA CCCCCGAGTC CGTGAACTGC TCGCGGACCT GGCCACGGGG
CGCGTCGCGC TGACGCACGA GGCCCTGCAC GCGCTGCCGA ACTGGCGGAC CGTCGCCTAC
CTGCGGGACC TGCTGATGGC CTGCGGCGTG CTACCCGCCG TCGACAAGCA GCTGCTGCAC
CACGAGACCT GGCTGCACCG CCAGCTGGCC GAGCTCGACG GCCACCCCCA CGCCCGGCTG
CTGCGCCAGT TCGGCACGTG GGCGCAGCTG CCGCGGCTTC GCCACCGGGC GGCGGCGCGC
CCGCTGACCC CGCACGCCCG CAAGGAGGCG GCCGCGCAGT TTACCCAGGC CCGGCTGTTC
CTGGCCTGGC TCGACGAGCG CGACCGGACA CCGGAAACGC TCACCCGGAC CGACGTCGAT
GTCTGGCACG CCACCCACCT CGACCACGCG AAACGCTCCC TGCGGACGTT CCTGACCTGG
GCGATGGACA GCGGCCATCT GCCCTGCCTC GACCTTCCCC GCCTCCAGAT CGTCCGCGCG
GAGCCTCTCA CCCAGCGGCG CCGCCTCGAC CTGGTGAAAT CCGTCCTGAC CAGCGAGACC
GGCTCGCCGC CGACCCGCGC CGCGGCCTGC CTGATGCTGC TCTACGCCCA GCCCGCCAGC
CGCATCGTGC GCCTCACCGT CGACGACCTC ACCCGCGACG GCGACCAGGT CCTGCTCCGG
CTCGGCGACC CGCCCGTCCC GGTTCCCGAC CCGTTCGCCA CGCTCCTGCT GACCGCCGCA
ACCCGGCGGG ACAACATGAC CACCGCCACG AACCCGGACA GCCGCTGGCT GTTCCCCGGC
CGCCGCGCCG GCCAGCCCCT GCACCCCTGC AGCCTGCTCG ACCAGATCCG CGCCCTCGGC
ATCCCGATCC AGGCCGCCCG CACCGCCGCG CTACGCCAGC TCGTCCTGCA AGCCCCCGCC
CCGGTCGTCG CCCAGGCCCT CGGCTACCAC CCGATCACCA CCCAGTGGCA CCGCGCCGAC
GCCGGCGGCA CCTGGACCCA CTACGCCCCC GGCGATCACG CCGGGCCGAG CCCCACGCCG
CCGGTCATCA CGTGA
 
Protein sequence
MTARTGRYTA EQYERWHIRV CARCGRRASL SANWSDGPIC RSCYDRAART YGRCPGCQAE 
RLLPGRDDDG AALCRDCAGI TRDFFCSRCG FEGLLLGGRL CERCTLTDQL AAALDDGTGN
VSPPLVPLLD ALRVMPKPKS GLAWLRNPRV RELLADLATG RVALTHEALH ALPNWRTVAY
LRDLLMACGV LPAVDKQLLH HETWLHRQLA ELDGHPHARL LRQFGTWAQL PRLRHRAAAR
PLTPHARKEA AAQFTQARLF LAWLDERDRT PETLTRTDVD VWHATHLDHA KRSLRTFLTW
AMDSGHLPCL DLPRLQIVRA EPLTQRRRLD LVKSVLTSET GSPPTRAAAC LMLLYAQPAS
RIVRLTVDDL TRDGDQVLLR LGDPPVPVPD PFATLLLTAA TRRDNMTTAT NPDSRWLFPG
RRAGQPLHPC SLLDQIRALG IPIQAARTAA LRQLVLQAPA PVVAQALGYH PITTQWHRAD
AGGTWTHYAP GDHAGPSPTP PVIT