Gene Franean1_1061 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_1061 
Symbol 
ID5669475 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp1250929 
End bp1252746 
Gene Length1818 bp 
Protein Length605 aa 
Translation table11 
GC content72% 
IMG OID641239990 
ProductFHA domain-containing protein 
Protein accessionYP_001505423 
Protein GI158312915 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00105103 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCTCGA CACCCGGCGG CCGACCGACC CCTCAGGTCG ACCGCACCTC CCGGTTCGGC 
TTCGGCCGGA ATCCCGCGGG CGCCCCGGCG TCCCGGCCGC CGCGGCCGCG CCGCCGGCAC
CGCTTCGGCC GTGGGCCGCG GCTTGGCCTG GCAGCCGGCT TCGTGCTGGC GCTGTCCCTG
CCGCTGGTGG CGCCGGCCGG TGCCGCGCCG ACGGCCGCCA CGCCGACACC GTCGCCCACC
GTGGGCACGA CCCCGGAGGC TGACGGAACC TCGTCGCCCG TGCCGTCCTC GTCGGCCGGG
GGTGTCGCCG ACCAGCAGGT CCCGGTCGCC GGCCTGACCT CCGGCCCGGC CACCGCCGGG
CAGTCGAAGC TGACTCTCGT CCTGTTCGAC AGCGACATCG TGCTCGTCGG CGGGCGTGGC
TTCGGGCCCG GCAAGGACGT GGCGGTCACC GCCGCGACCA CCGACCTCGG TGGCTCCGCG
AGCGCCCGGG CCGGCGCGGA CGGCCGGTTC ATCCTCGGCT TCCAGGTGCC GCTCGGTTTC
TCCGGCCGGG TGACCGTGAC GGCGAAGCAG GAGTCGGTGG AGGCCAGCGG CACGCTCGAC
GTCGTCGACG CTGGCACGCC GGGCCTGGCG AACGGGGCGG CCAACGGCGC CGAGGTCCCC
GCACCAGCCC CGACGGCGAG TCCTGTCCCG GGCCCCGCGG CCACCCCGGC GCCGACGACC
GCCGAGCCCA CGACTTCGGA GCCCACGACT TCGGAGCCCA CGGCCTCCGG GCCCACCAGC
TCGGAGCCGA CGGGCACCGC TCGGGCCACC ACCCCCCCGA GCACCGGCCG GCGGGGCTCG
TCGGCGCCGA CCGCCGTACC CGCGCCAGCG CCCACGGCCG CGCCGTCGGC GGGGAGCGGG
ACGGGCACCG GCACCACCAC CGGCGGCACC GGCCGGCTCT CCGGCCTGCC CTGGATGTCC
GGCGTCTACC CGTCGCACGT CCTGTCGCAG GTCATGTCCT TCGGGACGTG GCGCGGGCGG
GCGAACGACG TGGCGCACGT GTTCACCGTC CGTACCCAGG GCTGGAACGC GATGGTCGAG
CCGCGCTGGC CGCTGGACCT GTACAAGGCC TTCCCGGGCA AGCTGATTAT CAGCCAGCCG
ACCTATCCGA AGGGCCAGGG CAACAACGCG GCCTGCGCCC GCGGCGAGTA CGACAGCTAC
TGGAAGACCT TCGGCACGTT CCTCAAGAAC AACGGCCGCG CCGATTCGAT CGTCCGCATC
GGCTGGGAAT TCAACGGCAA GTTCATGTAC TGGCACTCGG ACCCGGCCGG GACGGAGTTC
CGCGACTGCT TCCGCAAGAT CTCGACTGCC ATCCGCTCGA CGGACCCCGC GGTGAAGATC
GACTGGACGT TCAACGCGCA CGCCTCGCCG GTTCCCAACG GGGGCACCCC GTGGGCGGCC
TACCCTGGTG ACGAGTACGT CGACTATGTC GGCATCGACT CCTACGACTG GTACCCGCCG
TCGCGGGACG AGGCCACCTG GAAGAAGCAG TGCGAGGACC CGAACGGCCT GTGCTACCTG
CTCGAGTTCG CCCGCCAGCA CGGCAAGAAG GTGGGCGTGG GCGAATGGGG CGTGTCCTCG
TGCAGCCGCA ACGGCGGCGG TGACAACCCC TTCTACATCC AGAAGATGTT CGACACGTTC
ACGAAGTACG CGGACGTGAT GGCGTACGAG TCGTACTTCC ACGACGCGGC GCCCGGCAAC
GTCTGCTCGA CCATCATGAA CGGCGGCCAG AACCCGAAGG CGTCCGCCCT GTACAAGAAA
CTGTTCGGCT CGGTCTGA
 
Protein sequence
MASTPGGRPT PQVDRTSRFG FGRNPAGAPA SRPPRPRRRH RFGRGPRLGL AAGFVLALSL 
PLVAPAGAAP TAATPTPSPT VGTTPEADGT SSPVPSSSAG GVADQQVPVA GLTSGPATAG
QSKLTLVLFD SDIVLVGGRG FGPGKDVAVT AATTDLGGSA SARAGADGRF ILGFQVPLGF
SGRVTVTAKQ ESVEASGTLD VVDAGTPGLA NGAANGAEVP APAPTASPVP GPAATPAPTT
AEPTTSEPTT SEPTASGPTS SEPTGTARAT TPPSTGRRGS SAPTAVPAPA PTAAPSAGSG
TGTGTTTGGT GRLSGLPWMS GVYPSHVLSQ VMSFGTWRGR ANDVAHVFTV RTQGWNAMVE
PRWPLDLYKA FPGKLIISQP TYPKGQGNNA ACARGEYDSY WKTFGTFLKN NGRADSIVRI
GWEFNGKFMY WHSDPAGTEF RDCFRKISTA IRSTDPAVKI DWTFNAHASP VPNGGTPWAA
YPGDEYVDYV GIDSYDWYPP SRDEATWKKQ CEDPNGLCYL LEFARQHGKK VGVGEWGVSS
CSRNGGGDNP FYIQKMFDTF TKYADVMAYE SYFHDAAPGN VCSTIMNGGQ NPKASALYKK
LFGSV