Gene Franean1_5522 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_5522 
Symbol 
ID5673852 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp6689148 
End bp6690857 
Gene Length1710 bp 
Protein Length569 aa 
Translation table11 
GC content72% 
IMG OID641244378 
Productreplication initiator protein 
Protein accessionYP_001509782 
Protein GI158317274 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0195674 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGGCC TCACCTCGAC TGACCCTGAC GCTTCCGCCG GCCGTGACGA TCGTCCGGGT 
TCCCGGGCGG CGCGGATGCG GACGCCGCTC GCTCGTCAGG TCGTGGAAAC GGTCGCAGTG
GAGAACGGGG TGTGCGTCCG GCCGATGGCG ATGCGCCGGA CGAACCTCGA CACCGGCGAG
ACCGAGATCA TCCCCGTACC GTGCGGCGCC ACGCTGGCGA GCAAGTGTCC GACCTGCGCG
GAGAAGGCCC GGCGGCTGCG GATGGCGCAG TGCAAGGCAG GCTGGCATCT CGACGACGAA
CCGCTACCCG ACCCGGACCC GCCCACGGAT GAGGCGAAGA CCCTCGCGGG CTTCCGTGCC
GATCTCGAAA CCGTCCGGAT CGACGCTGAA CGCGACGGGG ACGCGGCCGG CGTCGCCGAG
ATCGACGAAC TCATCGGCCA GGTGGACGAG GAACTCAACG CGCTGGGTGT GCGGGGGAAG
GCGGCGCCGG AGGATCGGGA TCGGCCTCGC CGTGTCCGCT CGACCCGCCG GCGGCAGGAT
GCCCCCGACC TGCCCCGGCT CCCGGTGGAC AAGCGGACAG TCGGGCGGAC CTTCGAAGCG
GCGGACGGCA CCACCTGGCG GCCGTCGATG TTCCTGACCC TGACCTGCGA CTCGTACGGG
CGGGTCACGA GTGAGGGAAC CCCGGTCGAT CCGGGCTCGT ACGACTACCG GCGGGCGGCC
CGGGACGCGA TCCACTTCCC GAAGCTGATC GACCGGTTCT GGCAGAACCT GCGCAGGGCG
GTCGGCTGGG ATGTGCAGTA CTTCGCCACC CTCGAACCGC AACGGCGGCT CGCCCCGCAC
CTGCACGCGG CCATCCGCGG AACGGTGCCC CGGGTCCTGC TGCGGCAGGT GGCGGCGGCC
ACGTATCAGC AGGTCTGGTG GCCGTCGTGT GACCGGCCGG TCTATGACGA CACGTGCCTC
CCGGTCTGGG ACGACACTGC GGCCGGCTAC CTCGACCCGG ACACGTCCCG GCCGCTCCCG
ACCTGGGATG AGGCAGTGGA CGCGATCGGC GACGATGCCG AACCGGCCCA CGTCGTCCGC
TTCGGGCCCC AGCTCCGCGC GGACGGTGTC ACGGCGAACT CGGCGAACAC CGGCCGGATG
ATCGGCTACC TGACCAAGTA TCTGGTGAAG AGCCTCGACG CCTGCCACGC CGTCACTACC
GACGCCCAGC GGCGGCACGT CGATCGGCTC GCGGACGCGC TGCGCCACGA ACCGTGCTCG
CCCACCTGCG CGAACTGGCT GCGCTACGGC GTCCAACCAC GCCACCCGAA ACCGGGCCTC
GTCCCGGGCC GGTGCCGGGG CAAGGTCCAC CGGCGCGAGA CGCTCGGGTT CGGTGGCCGG
CGGGTGCTCG TCTCGCGGCG CTGGTCCGGC AAAACCCTGA CCGACCACAA GCACGACCGG
GTCGCGTTCA TCCGGGAGCA GCTCGAAGCC CTCGGCCAGG TGGCGACCGG CCCGGCAGCC
ACTGGCACCG ACCCGGCACG GACGGTGTGG ACGCTGCTCC GGCCCGGTGA CCCGGCGGCC
CCTCGCCGCG AACACCTGCT GTTGCAGGCA GTCGCGCAAC GGCATGCCTG GCGCGCACAA
CTCGACGCGG CCCGCGCTGC CACGGCCGGC ACCGTCACGA CCGGCGGATC TCCGGGAACC
GGCCCACCGG CGGTGGCTGA CGCTGCCTGA
 
Protein sequence
MTGLTSTDPD ASAGRDDRPG SRAARMRTPL ARQVVETVAV ENGVCVRPMA MRRTNLDTGE 
TEIIPVPCGA TLASKCPTCA EKARRLRMAQ CKAGWHLDDE PLPDPDPPTD EAKTLAGFRA
DLETVRIDAE RDGDAAGVAE IDELIGQVDE ELNALGVRGK AAPEDRDRPR RVRSTRRRQD
APDLPRLPVD KRTVGRTFEA ADGTTWRPSM FLTLTCDSYG RVTSEGTPVD PGSYDYRRAA
RDAIHFPKLI DRFWQNLRRA VGWDVQYFAT LEPQRRLAPH LHAAIRGTVP RVLLRQVAAA
TYQQVWWPSC DRPVYDDTCL PVWDDTAAGY LDPDTSRPLP TWDEAVDAIG DDAEPAHVVR
FGPQLRADGV TANSANTGRM IGYLTKYLVK SLDACHAVTT DAQRRHVDRL ADALRHEPCS
PTCANWLRYG VQPRHPKPGL VPGRCRGKVH RRETLGFGGR RVLVSRRWSG KTLTDHKHDR
VAFIREQLEA LGQVATGPAA TGTDPARTVW TLLRPGDPAA PRREHLLLQA VAQRHAWRAQ
LDAARAATAG TVTTGGSPGT GPPAVADAA