Gene Franean1_2311 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_2311 
Symbol 
ID5670709 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp2760149 
End bp2761192 
Gene Length1044 bp 
Protein Length347 aa 
Translation table11 
GC content74% 
IMG OID641241230 
Producthypothetical protein 
Protein accessionYP_001506651 
Protein GI158314143 
COG category[S] Function unknown 
COG ID[COG2013] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.1559 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGCACG GCGGGATGTA CGGCGGCCAG CCCGGCGGGA TGCCCCCTGG CGGGTTTCCG 
CCCGGACCCG GTCCGGGCAT GCCCGGAGGG ATGCCTCCGG GTGGGATGCC CCCCGGGGGG
ATGCCGCCAG GAGGCCCCGG TGGTCCTGGT GGTCCCGGTG GGCCGGCGGG AGGGATCCGC
AGCAGCCTGC TCGGCAACCC CGAGGTCACC AGCGGCGAGC GGTTCGCCCT GCAGAACGGC
AAGATGCTCA AGGCGACCCT CGGCGCGCAG GGGATGCGTG AGTTCTACGC CCGGCGCGGC
GCCATGGTCG CCTACCAGGG CGCCGCGACC TTCGACGCCC ACTGGGAGGG CTGGGGCTCC
CGGTTCCGCA GCTTCTTCAG CGGCGGCGAG GGCCTGAACC TGATGAACGT CGCCGGCTCC
GGCGCGGTCT ACCTCGCCAA CCAGGCCCAG GACATCCACA TCCTCGACCT GGCGGGTGAC
GGCCTGACCG TGGACGGCAA GAACGTGCTC GCGTTCGACG CCGGCCTGAA CTGGGATCTC
GTCCGTGTCG ACAGCCAGGT GGGCATCGCC GGCGTCGGCG GTTACCAGAT CGAGCTGCGC
GGCAACGGCC AGGCCGTCGT GTGCACCTCG GGCGCCCCGC TGGTGATGCG GGTGACCACC
CAGAACTACT ACTTCGCGGA CGCCGACGCC GTCGTCGGCT GGTCGTCGAG CCTGCAGGTC
TCGATGCAGG CCGCGGTCAC CTCCAGCGCG GCCTGGAAGC CGCGGGGCAA CACCGGGGAG
AGCTGGCAGC TCCAGTTCTC CGGCGAGGGC TATGTGATCG TCCAGCCCTG CGAGCTGCTG
CCGCCCTACA ACGCGCTCGC CGGCTCCGGC CTGGCCGGGC AGTTCGGGCT TGGCCAGGGC
GGGTTCGCCG GCAACCAGCT CGGCGGGCAC GGCGGCGGCC ACGGCGGCCA CGGCGGGCAC
GGCGGGGGCC CGGGCGGGTT CGGCGGCGGC CCCGGTGGGC TCGGCGGCTT CGGCGGACTG
TTCGGAAACC AGGGCCGCCA CTGA
 
Protein sequence
MTHGGMYGGQ PGGMPPGGFP PGPGPGMPGG MPPGGMPPGG MPPGGPGGPG GPGGPAGGIR 
SSLLGNPEVT SGERFALQNG KMLKATLGAQ GMREFYARRG AMVAYQGAAT FDAHWEGWGS
RFRSFFSGGE GLNLMNVAGS GAVYLANQAQ DIHILDLAGD GLTVDGKNVL AFDAGLNWDL
VRVDSQVGIA GVGGYQIELR GNGQAVVCTS GAPLVMRVTT QNYYFADADA VVGWSSSLQV
SMQAAVTSSA AWKPRGNTGE SWQLQFSGEG YVIVQPCELL PPYNALAGSG LAGQFGLGQG
GFAGNQLGGH GGGHGGHGGH GGGPGGFGGG PGGLGGFGGL FGNQGRH