Gene Franean1_0225 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_0225 
Symbol 
ID5668650 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp273938 
End bp276175 
Gene Length2238 bp 
Protein Length745 aa 
Translation table11 
GC content73% 
IMG OID641239154 
ProductMMPL domain-containing protein 
Protein accessionYP_001504598 
Protein GI158312090 
COG category[R] General function prediction only 
COG ID[COG2409] Predicted drug exporters of the RND superfamily 
TIGRFAM ID[TIGR00833] Transport protein 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.196903 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.86442 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGCCTGC CGCGTCCCGG TGCCACGCTC CGCACGCGGC GGTTCCGCTG GGCGGTCGTC 
CTGGTGTGGC TGCTGATCGG CGTCGCCGTC AGCCCGCTCG CGTTGATGCT CACCGACGCC
GAGACCAACG ACGCGGCCGC CTTCCTCCCC GAGGGCGCCG AGTCGACCCG TCTGCTGGAG
GCCCAGCGGC AGCTCCCCGG CGGTGACGCC GTGCCGGCGG TGGTCGTCCT GGCCCGTGAC
GGCGGCCTGA CCAAGGCCGA CCTGGACGTC GCGACGACGT TGCGCACCGA ACTCGCCGGG
TTCGCGGCGA CGACGACGAT CCCGGGAGTG ATCCCCGCGC CGGATCACCG CGCGGTGATC
ATCACCGTCC CGATGCCGCA GACCACGGAC GCCGACCGTT TCACCGCCGA CGTAGCGCGC
ATGCGGACAA TCGCCAGGCA GGCCGCCGCC GCGGAACCAG GCCTGGATTC CGCGGTCACC
GGGCCCGCGG GCCTGGTCTC CGACACCTAC GACGTGTTCG TGAAGATCGA GGGCGCACTG
CTGCTCGTGA CCGCGTCGAT CGTGGCCGTG ATCCTGCTCG CGGTCTACCG AAGTCCGTTC
CTGTGGGTCG TGCCGCTGCT GTCCGTCGGC ATCGCCGACC AGACCGCGGC CGGACTGATC
TACCTTCTCG CCGAGCACGC CGGGCTCACG GTGAACGGCC AGAGCGCCGG AATGCTGCGG
GTGCTCGTCT TCGGCGCCGG CACCGACTAC GCGCTGCTGC TCATCTCCCG CTACCGCGAG
GAGCTGACCC GGCACGCCGA GCCGGCCACC GCGATGCTGG TCGCGCTGCG CCGGGCCGGC
CCGGCGATCC TGGCGTCGGC GGGCACAGTG ATCATCGGCA TGCTGTGCCT GCTGCTGGGC
GAGCTGAACT CGCACCGCGG GCTGGGCCCG GTCTGCGCGA TCGGGATCGC GGTCGCGCTG
GTCACCATGC TCACGCTGCT GCCGGCGGCG ATGGTGGTCG GCGGGCGACG GCTGTTCTGG
CCGTTCGTGC CACGTCTGGG CCAGGCGGAG CGGTCGGTGG AGGCCGGCGC GTGGGCGCGC
GCCGGCGCCG TGATCGGGCG CCGCCCGCGG ACGGTCTGGG TCACCACCGT GGCCGTGCTG
GGCGCGCTCG TCATCGGCAT GCTCGTCCTG CCCGGCGTCC TGCGCCAGGA CCAGGCCTTC
CGCGACAACG TCGAGTCGGT CCAGGGCCAG CGCCTGGCCG AGCGGAGCTT CCCGCCGGGC
GTCACCGCGC CCACGTTCGT GGTGGCGAAC AGCGCGCGCG CCGATGCCGT GGCCGAGGCG
GTCCGCGCGA CTCCCGGCGT GGCCGCGGCG GCGGAGAGCG GGCGGACGTC CGAGCTCGTC
CAGTTCCTGG TCGTGCTCGA CGCGCCGCCG GACAGTCCCG AGTCGTTCCG GACGGTGGAG
AAGCTGCGCA CGACCGTGCA CGCGCTGCCC GGCGCCGACG CCCTCGTCGG CGGCAACACC
GCCGTCAACC TGGACATCCG GGACGCGGCC GTGCGGGACC GCGAGCTGAT CATTCCGGTC
GTGCTCGTCG TGGTGCTGCT CATCCTCGGG CTGCTGCTGC GTTCCGTCGT CGCGCCGCTG
CTGCTCATGG GCACGGTCGT GCTGTCGTTC TTCGCCGCCC TCGGCGCGAG CGCGCTGGTC
TACCACTACG TCTTCGACTT CCCCGGCATC GATCCGGCGC TGCCGCTGAT CGGTTTCATC
TTCCTGGTGG CGCTCGGCGT CGACTACAAC ATCTTCCTGA TGACCCGGGT GAAGGAGGAG
ACCGAGCACA TCGGGCACGC CGCCGGTGTC CGCCGTGGCC TCGCGGTGAC GGGCGGGGTC
ATCACCTCGG CCGGGGTCGT GCTCGCCGCC ACGTTCGCCG TGCTGCTCAT CTTCCCGCTG
GTCCAGCTCG CCGAGGTCGG CTTCCTGGTC GCGTTCGGCG TTCTGCTCGA CACGCTTGTG
GTCCGCTCGG TGCTCGTGCC GGCGCTCGCA CTGGACGTCG GCCCCGTCGT GTGGTGGCCC
AGCCACCCGG AACGCGCCCG GCCCGCCGGC CCAGTCAACG GCCAGCTCAC CGACCAGCTC
ACCGACCACG AGACGCTCGC TGACTTCGGA GCGCTCGGTG TTCCCCTGAG CGCGATGGAG
CCCGCGGCCG CGGCCCTCGA GAGTGCCGAG GCACGAGCCC GGGCCGAGGA GGACGAAGAC
ACCGAAGCGA ACCGATAA
 
Protein sequence
MGLPRPGATL RTRRFRWAVV LVWLLIGVAV SPLALMLTDA ETNDAAAFLP EGAESTRLLE 
AQRQLPGGDA VPAVVVLARD GGLTKADLDV ATTLRTELAG FAATTTIPGV IPAPDHRAVI
ITVPMPQTTD ADRFTADVAR MRTIARQAAA AEPGLDSAVT GPAGLVSDTY DVFVKIEGAL
LLVTASIVAV ILLAVYRSPF LWVVPLLSVG IADQTAAGLI YLLAEHAGLT VNGQSAGMLR
VLVFGAGTDY ALLLISRYRE ELTRHAEPAT AMLVALRRAG PAILASAGTV IIGMLCLLLG
ELNSHRGLGP VCAIGIAVAL VTMLTLLPAA MVVGGRRLFW PFVPRLGQAE RSVEAGAWAR
AGAVIGRRPR TVWVTTVAVL GALVIGMLVL PGVLRQDQAF RDNVESVQGQ RLAERSFPPG
VTAPTFVVAN SARADAVAEA VRATPGVAAA AESGRTSELV QFLVVLDAPP DSPESFRTVE
KLRTTVHALP GADALVGGNT AVNLDIRDAA VRDRELIIPV VLVVVLLILG LLLRSVVAPL
LLMGTVVLSF FAALGASALV YHYVFDFPGI DPALPLIGFI FLVALGVDYN IFLMTRVKEE
TEHIGHAAGV RRGLAVTGGV ITSAGVVLAA TFAVLLIFPL VQLAEVGFLV AFGVLLDTLV
VRSVLVPALA LDVGPVVWWP SHPERARPAG PVNGQLTDQL TDHETLADFG ALGVPLSAME
PAAAALESAE ARARAEEDED TEANR