Gene Franean1_3441 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_3441 
Symbol 
ID5671812 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp4071324 
End bp4073876 
Gene Length2553 bp 
Protein Length850 aa 
Translation table11 
GC content70% 
IMG OID641242329 
ProductTPR repeat-containing protein 
Protein accessionYP_001507749 
Protein GI158315241 
COG category[R] General function prediction only 
COG ID[COG0457] FOG: TPR repeat 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGCGAGG AAGCCGGCTG GGACTTCTTC GTGTCCTATG CGGCGGACGA TGTCGAGTGG 
GCCGAGTGGA TAGCGTGGCA TCTGGAGGCT GCGGGCTATC GCGTCCTGTG TGAGGCGTGG
GACGCCGTGG CCGGATCGAG CCGCGATCGC CTCATCGACG ACGCCGTCGG CAGGTCGGTG
AGGACGCTCG CCGTGCTGTC GGCGGCATAT CTTCGCGCAC CTTCGGTGCA GAGCGAATGG
CGCGCAGCGT GGCGCAGGGA TCCCGATGGG CTGACCCGCA GGCTGATTCC GGTCAGGATC
GATGCCTGTG AACCGGAAGG TCTGCTTGGC GGCGTCGTTC CCATCGACCT GTTCGGACTG
GACGAGGCCG GCGCGCTGGC GCGTCTGCAG ACCCGGATAG ACGGCGCCCG CGCGGGTCGG
CAGAAGCCCG CTGGTGAGCC ACCCTTCCCC GCCCGCGGCG GGAACGGCCG CCCGGACGGG
TCGCAGCCCA CGCCTCGTCC GCCGGCTCCG CAACCAGCTC GTCCAGGACA TGAAACACCA
GCCCTGCTGC CCAGGGACAT CCGGGACTTC ACAGGTCGCA GGCGCGAGAT CGCCGAGCTG
GACGCGATGC TCGGGCCGGG CACATCGACG ATGGTCATTT CCGCGGTGGA CGGGACGGCC
GGCGTCGGGA AAACCACGCT CGCCGTCCAC TGGGGCCACC GGGTCAAGGA CCGGTTCCCG
GACGGCCAGA TCTACCTCGA CCTGCGAGGG TACGCACCGA CACTGCCGAT GCAGCCGCGC
CGGGCGCTCG GCCTGCTGCT CGGCGCCCTC GGCGTCACGG ATGCCGACAT CCCGATGACG
CTGGAGGCGC GCTCGTTGCT GTACCGGCGG CTGACGGGAA GCCGCCGCAC GCTGCTGGTC
CTGGACAACG CCCTCGACGT GGAACACGTT CGGCCGCTTC TGCCCGCCGG CACCTGCCTC
GCGTTGATCA CCAGTCGGAG CAGACTGAGC GGGCTGGCGG TACGCGACGG AGCCCGCCTG
ATCTCGCTGG ACGTCCTCAG CCCGGACGAG TCGACCGCAC TGCTGCGGCA GGTTCTGGGC
GAGGTCCGCG TGGAACGCGA GCTCGCGGCG GCCGAGCGGC TCGCGCAGCT GTGTGGACAC
CTTCCGCTGG CGCTGCGGAT CGCGGCGGTC CGACTGCTGA CGAGGCCTGG CTTCGACATC
GCGGACGCCG TCTTCGAGCT GGCCGACGAA AACGCGCGGA TCGACGTGCT ATCGCGGGAC
TCCGACGAGC ACGCCGCCGT TCGTAGCGTG TTCTCCTGGT CCTATCTGAG GCTGCGCCCG
GTCGAGCAGC GAGTGTTCCG GCTCCTTGGT CTCCACCGGG GCCCATCGCT GAGCGACGAC
GCCGTCGCGG CGCTTGTCGG CGTGCCCACG GCACAGGCCG GTGCGATGCT CCATACGCTC
GTCGCGCTGC ATCTGATCGA GGAGGAACGC CCTCGCCGCT ATCGCATCCA TGACCTGCTC
CGGCTGTACG CGGGCGAGCT GTGCACCCGG GACGAGCCCG CGCAGCAGCG GCGGGCTGCC
GTCGAGCGCA TCCTCTACTG GTACGTGGCC ACCTGCACCG CCGCCGCGAC CCTCGTGGAG
GGGCGGACGT CGGACAAGTC GGCGCCGCTC GCGCCACATG CCGTGGTGCG GCCGCTCGTG
TTCGGCTCGG CCAGCCAAGC GCTCGCCTGG TTCGACGACG AGTACGCCAC GATGCTCGAC
ATCGCCAACC ACGCGTACAG CAACGAACTC GACGACCTCT GCGGTCGGTT GGCGCTGGCG
GCGTGGCCCT TCTTCCAGCG CCGCAGCCGT TGGGCGGACT GGATCGAGTT GCAGCAGGTC
AGCCTGGCGG GCGCGCGGCG ATCCGGCGAC GAGCAGACCG AGGCCTGGCT GCTCGGCGGC
CTCGGCGACG TTCTCGACGA CCAGGAGCGG TACGAAGAGG CCCTGGAATG CCATCAGCGG
GCCATCGACA TCCATCGCCG GCTCGGGAAC CCGAAGGGTG AGGCCGTCGC GCGCAACAAC
CTCGCGGTCA GCCTCGACAA CCTTGAGCGG TACCCGGAGG CGATCGAGCA CTACACCGCC
GTGCTGGCGA TGTTCCGCGA CCTCGGCGAT CTCGCCAACG TCGGAATGGT CCTGAACAAT
CTCGGCGCCG CGCACTTCAT GATTGACCAG TTCGATGCCG CGGAGCGTCA CTACCGGGAG
GCTCTGGAGA TCCGCCAGGC CCTCGAAGAC GCCTTCGGCG AGGGCATGAC ACTGCACAAC
CTCGCCGACG TCGCGGAGGC GCTGGACCGC CTCGATGAGG CACGTGACTG GTACGAGCGC
TCCATTCCCC GGCACAGGGC GGCCGGGCAC CTGCGCGGCG AGGCACGGGC ACTGCACTTC
CTGGGCCGGG TGAACCAGCG TCAGGGAGAC CTAGAGACCG CCCGCGCCCA CTGGCGCGCG
GCACTCGACA TCTTCGAGCG CGTCGGCGAT CCGGAAGCTG ACGACCTGCT GGCGCTGCTG
GCCGACACCA GGACGAGCAC GCCGGGCGCG TAG
 
Protein sequence
MGEEAGWDFF VSYAADDVEW AEWIAWHLEA AGYRVLCEAW DAVAGSSRDR LIDDAVGRSV 
RTLAVLSAAY LRAPSVQSEW RAAWRRDPDG LTRRLIPVRI DACEPEGLLG GVVPIDLFGL
DEAGALARLQ TRIDGARAGR QKPAGEPPFP ARGGNGRPDG SQPTPRPPAP QPARPGHETP
ALLPRDIRDF TGRRREIAEL DAMLGPGTST MVISAVDGTA GVGKTTLAVH WGHRVKDRFP
DGQIYLDLRG YAPTLPMQPR RALGLLLGAL GVTDADIPMT LEARSLLYRR LTGSRRTLLV
LDNALDVEHV RPLLPAGTCL ALITSRSRLS GLAVRDGARL ISLDVLSPDE STALLRQVLG
EVRVERELAA AERLAQLCGH LPLALRIAAV RLLTRPGFDI ADAVFELADE NARIDVLSRD
SDEHAAVRSV FSWSYLRLRP VEQRVFRLLG LHRGPSLSDD AVAALVGVPT AQAGAMLHTL
VALHLIEEER PRRYRIHDLL RLYAGELCTR DEPAQQRRAA VERILYWYVA TCTAAATLVE
GRTSDKSAPL APHAVVRPLV FGSASQALAW FDDEYATMLD IANHAYSNEL DDLCGRLALA
AWPFFQRRSR WADWIELQQV SLAGARRSGD EQTEAWLLGG LGDVLDDQER YEEALECHQR
AIDIHRRLGN PKGEAVARNN LAVSLDNLER YPEAIEHYTA VLAMFRDLGD LANVGMVLNN
LGAAHFMIDQ FDAAERHYRE ALEIRQALED AFGEGMTLHN LADVAEALDR LDEARDWYER
SIPRHRAAGH LRGEARALHF LGRVNQRQGD LETARAHWRA ALDIFERVGD PEADDLLALL
ADTRTSTPGA