Gene Franean1_3816 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_3816 
Symbol 
ID5672180 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp4533893 
End bp4535461 
Gene Length1569 bp 
Protein Length522 aa 
Translation table11 
GC content71% 
IMG OID641242695 
Producttail sheath protein 
Protein accessionYP_001508115 
Protein GI158315607 
COG category[R] General function prediction only 
COG ID[COG3497] Phage tail sheath protein FI 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.414451 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.268079 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGAACT ACCGGTCGCC CGGCGTCTAC GTCGAGGAGG TCGAGGCGGG GTCCCGCCCG 
ATCGAGGGCG TCGGCACCGC GGTCGCCGCG TTCGTCGGCT TCACCGCGCG CGGTCCTCGC
AACGACGCGG TAAGGATCGC CAACTGGGGC CAGTACGTGC AGGTCTTCGG CGACTTCGTA
CCCGGCGCCT ACCTGCCACT GGCGGTCTTC CAGTACTTCA ACAACGGCGG CGGGGCCTGC
TACGTGGTCA GCGTGGGCTC CGGCGGCGAC GGGGCGACCG CGGGCCCGGT GGCCGAGCTC
ACCAGCGCCA CCCGTCCCGG CGTGGGTGTC TACCAGGTGG AGGCGATCGA CCCCGGGGCG
CGCGGCGGCC TGGCGGTCGA GATCGCGCCG ATCGATCGGG CCGGCGGCGA GGCGGCGAAC
GGGGGGGACG GTGACGGCGC GGCGGCAGAC GTCGGCCGCG CCTTCACCGT GATCGTCACG
CAGGACGGCC GGCAGGTGGA GACGTTCGAG AACCTCACCG CCCGCCGCGG GAAGCGCAAT
GTCGCCACCG TCATCAACGA GCAGTCGCAG CTGGTCCGGG TGATCGAGTT GGACGGGGTG
ACCGCGCTCG ACCGTGCCCC CTCGCCCGGC ACGGTCGCGC TGCGCGCGCC GGCGCCGGCG
CCGGCGCCGC GGGCGAGCGC CGAGGACTTC ATCGGCGACG TCTCCGCCCG TACCGGGGTG
TGGGGGCTCG CTGCCTGCGA CGACGTCACC ATGGTCCTGG CTCCCGACCT GATGTTCTGC
TACCAGCAGG GACAGATCGG CCTGGACACG GTGCAGGCCG TGCAGCAGGC GATGATCGAC
CACTGCGAGA ACCTGGGCGA CCGGATGGCG ATCCTCGACC CGCCGCCTGG GCTGAACGCA
CAGCAGATCA AGGACTGGGT GAAGGAGAAG GCCCGGTACG ACTCCAAGTA CGCCACCCTC
TACTGGCCGT GGGTGTCGGT TTTCGACCCG GTGGAGGGCG GGCGCCGGTT CATCCCACCG
TCCGGGCCGG TCGCCGGGAT CTGGGGCCGC AGCGACGACA GCCGCGGGGT GCACAAGGCA
CCCGCCAACG AGGTGGTCCG CGGCGCCCTC GATCTGGAAA TCAACATCAC GAGGAGCGAG
CACGACGCGT TGAACCCCGA GGGGATCAAC GTGATCCGCG CTTTCCCCGG CCGCGGGATC
CGGGTCTGGG GCGCACGCAC CCTCTCGTCG GACCCGGCCT GGCGCTACGT CAACGTCCGG
CGGCTGTTCA ACTATCTGGA AGCGTCGATC CTCAATGGCA CCCAGTGGGC GGTGTTCGAG
CCGAACGACC TCGACCTCTG GCAGCGGCTG CGGCGCACGG TGTCCGCGTT CCTGCTGGGC
ATGTGGCGGG ACGGCGCCCT CTTCGGTATC ACCCCGGACC AGGCCTTCTA CGTCAAGTGC
GACGAGGAGA CGAACCCACC CGGTGTCGTC GACGCCGGCC AGGTGATCAT CGAGATCGGG
GTCGCGCCGG TCAAGCCGGC CGAGTTCGTG GTCTTCCGGA TCGCTCAGAT CACCGCCGGC
GCCGAGTAG
 
Protein sequence
MPNYRSPGVY VEEVEAGSRP IEGVGTAVAA FVGFTARGPR NDAVRIANWG QYVQVFGDFV 
PGAYLPLAVF QYFNNGGGAC YVVSVGSGGD GATAGPVAEL TSATRPGVGV YQVEAIDPGA
RGGLAVEIAP IDRAGGEAAN GGDGDGAAAD VGRAFTVIVT QDGRQVETFE NLTARRGKRN
VATVINEQSQ LVRVIELDGV TALDRAPSPG TVALRAPAPA PAPRASAEDF IGDVSARTGV
WGLAACDDVT MVLAPDLMFC YQQGQIGLDT VQAVQQAMID HCENLGDRMA ILDPPPGLNA
QQIKDWVKEK ARYDSKYATL YWPWVSVFDP VEGGRRFIPP SGPVAGIWGR SDDSRGVHKA
PANEVVRGAL DLEINITRSE HDALNPEGIN VIRAFPGRGI RVWGARTLSS DPAWRYVNVR
RLFNYLEASI LNGTQWAVFE PNDLDLWQRL RRTVSAFLLG MWRDGALFGI TPDQAFYVKC
DEETNPPGVV DAGQVIIEIG VAPVKPAEFV VFRIAQITAG AE