Gene Franean1_7033 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_7033 
Symbol 
ID5675344 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp8580940 
End bp8583564 
Gene Length2625 bp 
Protein Length874 aa 
Translation table11 
GC content72% 
IMG OID641245879 
Producthypothetical protein 
Protein accessionYP_001511270 
Protein GI158318762 
COG category[S] Function unknown 
COG ID[COG1289] Predicted membrane protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGGAACG GATGGCTGGG ACGGCTCGTT CGGGCGTTCT CCGACGATCC CGGGCTGATC 
AACCTCAAGG CGGCCGTCCG GGTCGCCGCC GTCGCCCCCA TGTTGCTGGC GGTCACCTAT
CTGGTCGGTG GGGACATGCG GCTGAGTCTG TTCGCCTGGT TCGGCTCCTA CGTCCTGTTG
GAGTTCGTCG AGTTCACCGG GCCGGTCAGT ACCAGGCTGG TGGCCTATGT CGCGTTCGCG
TTCGGCGCCG TCCCGCTTCT CGTCGTCGGG ACGCTGTGTT CCCGGACGGC CTGGCTGGCG
GTACCGGCAA CTGCATTGGT TGCCTGGGTC GTGCTGTTCT CCGGGGTCCT CAACGCGTAT
TTCGCCATCG CCGGACGGGC CACGCTGATG GCCTTTGTCC TGGCAGTCAT GACTCCCGGC
CCTGTGTCGG CCATTCCGGA ACGGCTTGCC GGCTGGGGGG TCGCCATGGT GACGGCCGTC
ACCGCGGCCA TGATGCTGTG GCCGGAGCGG CCACCGACTC GGCTGCGTGC CGCGCTGGCC
CAGGCGTGCC GGTCGATGGC CGCCGGGGTG GCGTGGACGA CGATGCCTTC TGAGGGGTCC
GGCGGGACCG GGGCGGAGGA AGTCCACCGG ATCTGGACCG ATGTGTTCCA TCTGCGCCGC
AGGTTCGTCG AGACCGCGCA CCGGCCCGGC GGTGTCGGCG GGCGCACCGC GGCGTCGGGG
CATCTGGTCG TGGATGTCAA CTGGTTGGTC CCGTTCGCTT TACCGCGCGC GGACCGGGAC
GGGATCGCTG CGGTCTGCTT CCGCCGGGAG GCGGCCGAGC TCCACGCCGC CGCCGCTGCC
ACGCTCACCG CCGCGGCCGA TCGACTCGAC CGCGATCCAC AGACGAGCGA GCGGCTCGGT
CTGGACCGCC TGGAGCGGGC TGAGCGGGGG ATGCGCACAG CCCTGCTGGA CTACGTGGAG
TGCTCAGACA ACCGGCCGGC GCCTGACGTT ACTGTGCCGA TCGCCCGATC GGCCAATTCT
GGTGTCGCGG GGTCGGAGCA GCCACCGGCG GACCTCGCCA TCGCCGAGGC GTTCCGGCTG
CGCAGGCTGG CCCGGGGTAC TCGGGAGCTG GCGGTGAACG TGCTGTGGGT GACCGCGCCG
GCGCCGACGC TGCGTTCCTG GATACGGCCG CGTGGCTGGT CGGCCAGGCT GAGGCGGCTC
GCCCGCCGCG GCGTCGCCGC CACCGATCTG GCCGCCGGAT ACGTCAGCTT GCGGTCGGTG
TGGTTCCGCA ACAGTGTGCG CGGCGCCGTG GGGCTCGCCC TCGCTGTCGC TGTGGCCCAG
AACATCGAGG TGCAGCATGG CCTCTGGGTC GTGCTCGGCA CGCTGTCGGT CCTGCGGTCG
AACGTGGCGG CGACCAGCTC GACGATCCTG CGGGGACTGC TGGGCACCGG GGTCGGTATC
GTGGTGGGTG GCCTGTTCGT CGCCCTGGTC GGCGCCCATA CCGTGGTGTT GTGGCTGGTG
CTGCCGCTGG CCATGTTCGT CGCCGGCTAT CTGCGGCGCA GACCGTCCTT CGCGCTGGGG
CAGGCAGGCT TCACCGTCGC CATCCTCATC CTTTTCGACA TCATGGAGCC GTCGGGCTGG
CAAGTGGGTC TGGTTCGCAT CCAGGACGTG ATGATCGGTT TCGGGGTGAG CCTGGCTGTG
GGAGCGCTGC TGTGGCCCCG GGGAGCGGCG GCGGTGATCC GGCGCCGGGC CGCCGCCGCT
TACCGGACGG GCGCGGACCT CCTCGCGCTG GTCGTCATGC GGGCACCCGG TGACGACGAC
CCGCCCAGCG CCCGGCCGCG GACCTCGCCA TGGCTGTACC CGGCCTACAC CGGAACATCG
GGCTATCCGG GCAGCACCGC CGCCGCACCG GTGAGCCGGG TCGACACCCC TCGCGCAGAG
AACGCTGACA CCACAGGCCC GGCCGACGGG GCCGTCCGCC GTCCCCGTCG GGACGAGGAC
GTCACCGGCG CGGCGCGCGA CGCCATCCGC GCCGGGCGGC TGCTCGACGA CGTGGTCCGC
CAGTACCTGT CGGAACAGTC CAACGACCGG GTCGACGTCG ACGCGCTCAT GACCATCGTT
GGCGGGGCAC TGCGGCTGCG CCGGACCGCA CAGCTGCTGC GGGCCGGCGA CGTGCCCTGG
CCAGCGGACA TCCTGCGGAC CGGGGGCGAT GTCAGCGCGA CCGCCTCGAC CATGGAGATC
TCCGCCGCGC TGCCCGATCT GGCCGCCGCT CAGGAGGCTG TCACAGTGGA GACGGCCGAG
CTGTGCGACT GGTACCGGCG ATTCGCCGAG GCACTCGATG ACGGCAGGCC ACCACCAGCC
GACGCGAACC CTGGTCCAGG CCCGGCCAGC ACGGCGGCAC TTGCCGTGGT GCGTCACGGC
GCCGCGGCGC ACCGACGCCC GGAACTGCGT GCCGGCGTGG CGCTGGCAGC ACGCGCGACC
TACCTCGACA TCCTGCGGGA TCTGCGACCG ATGCTCACCG ACGCGGGTAC GGCGCTCGCT
CACCGGCCGG GCCGACCCTG GGCTAGCCGG CGGAGGACAG CCGAAGGGAG GGGAGCGGCC
GCCAGACCCA GGCCCGCCCC GAACGTTCAG CGTGCCGGCG GGTGA
 
Protein sequence
MRNGWLGRLV RAFSDDPGLI NLKAAVRVAA VAPMLLAVTY LVGGDMRLSL FAWFGSYVLL 
EFVEFTGPVS TRLVAYVAFA FGAVPLLVVG TLCSRTAWLA VPATALVAWV VLFSGVLNAY
FAIAGRATLM AFVLAVMTPG PVSAIPERLA GWGVAMVTAV TAAMMLWPER PPTRLRAALA
QACRSMAAGV AWTTMPSEGS GGTGAEEVHR IWTDVFHLRR RFVETAHRPG GVGGRTAASG
HLVVDVNWLV PFALPRADRD GIAAVCFRRE AAELHAAAAA TLTAAADRLD RDPQTSERLG
LDRLERAERG MRTALLDYVE CSDNRPAPDV TVPIARSANS GVAGSEQPPA DLAIAEAFRL
RRLARGTREL AVNVLWVTAP APTLRSWIRP RGWSARLRRL ARRGVAATDL AAGYVSLRSV
WFRNSVRGAV GLALAVAVAQ NIEVQHGLWV VLGTLSVLRS NVAATSSTIL RGLLGTGVGI
VVGGLFVALV GAHTVVLWLV LPLAMFVAGY LRRRPSFALG QAGFTVAILI LFDIMEPSGW
QVGLVRIQDV MIGFGVSLAV GALLWPRGAA AVIRRRAAAA YRTGADLLAL VVMRAPGDDD
PPSARPRTSP WLYPAYTGTS GYPGSTAAAP VSRVDTPRAE NADTTGPADG AVRRPRRDED
VTGAARDAIR AGRLLDDVVR QYLSEQSNDR VDVDALMTIV GGALRLRRTA QLLRAGDVPW
PADILRTGGD VSATASTMEI SAALPDLAAA QEAVTVETAE LCDWYRRFAE ALDDGRPPPA
DANPGPGPAS TAALAVVRHG AAAHRRPELR AGVALAARAT YLDILRDLRP MLTDAGTALA
HRPGRPWASR RRTAEGRGAA ARPRPAPNVQ RAGG