Gene Franean1_6412 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_6412 
Symbol 
ID5674727 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp7784650 
End bp7786599 
Gene Length1950 bp 
Protein Length649 aa 
Translation table11 
GC content71% 
IMG OID641245260 
ProductABC transporter related 
Protein accessionYP_001510655 
Protein GI158318147 
COG category[V] Defense mechanisms 
COG ID[COG1132] ABC-type multidrug transport system, ATPase and permease components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.443999 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCCGCC CACCCGCGGG CGGGGCCCGC CCCATGGCAG GCCCCGGTCG GTTCTTCGCC 
GCCGGGGGCG AGAAGCCCGA GGATTTCGCG GCCTCGACCC GGCGGGTGCT CCGCCTGCTG
GGCCGGCAGC GCGCGTTGCT GATCCCCGCG GTCGCGCTGG CGGTGGTCGG GATCGCGCTC
ACCGTCACCG GGCCGCGGCT GCTCGGGCAC GCCACCGACC TGGTGTTCGC CGGCCTGCTC
AGCTCCCGGC TGCCCGCCGG GACGACGAAG GCCGAGGCCG TCGCGCGGCT GCGCGCCGAG
GGCCACGGCA CGCAGGCCGA CCTGCTGTCC TCCGTCGACG TCACCCCCGG TGAGGGGATG
GACTTCGGCG CGATCGGCCG GGTGCTGCTG GTGGTGCTGA TCGTCTACCT CGTCGCCGGC
CTCTGCACGG TGCTGCAGGC CCGGCTGGCG AACAAGGCCC TGCAACGGAT GCTGAGCGAC
CTGCGGGCCG ATGTGCAGGA GAAGATCACC CGCCTGCCGC TGCGGTACTT CGACCGCCGC
CAGCGCGGCG AGGTGCTCAG CCGGGTCACC AACGACATCG ACAACCTCGG GCAGAACCTG
CAGCAGAGCC TGTCCCAGAT GATCGCCTCG GTGCTGACCA TCATCGGCGT GCTGGCCATG
ATGATCTGGA TCTCCTGGAT CCTCGCACTG ATCGCCGTCG TCACGGTGCC GGTGTCGATC
GTGATCACCA CCCGGATCGG CAAGTTCGCC CAGCCCCAGT TCGTCAGCCA GTGGAAGACG
ACCGGCCGGC TCAACGGCCA CATCGAGGAG ATGTACACCG GGCACGCCCT GGTGCGCGCG
TTCGGCCGGC AGGAGGAATC CGCCGAGATC TTCCGGGAGC ACAACGAGCG CCTCTACGAG
GCGAGCTGGC GGGCCCAGTC CATCTCCGGT CTCATGCAGC CGGCGATGAT GTTCATCGGC
AACCTGAACT ACGTGCTGGT GGCCGTCGTC GGTGGCCTTC GGGTGGCCTC CGGCGCGCTG
TCGATCGGCG ACGTCCAGGC GTTCATCCAG TACTCGCGAC AGTTCAGCCA GCCGCTGACG
CAGCTCGCCA GCATGGGGAA CATGGTGCAG TCCGGGATCG CGTCGGCGGA GCGGGTGTTC
GACCTGCTGG ACGCGCCCGA GCAGGAGCCC GACCCGCTGG CGCCGGCCCG CCCGGAGGAG
AACCACGGCC GGGTCGCCTT CGAGCACGTC GCGTTCCGCT ACGAGCCGGA CAAGCCCCTC
ATCGACGACC TGTCGCTGGT TGCCGAGCCG GGGCACACCG TGGCGATCGT CGGGCCCACC
GGCGCCGGAA AGACCACACT CATCAACCTG CTGATGCGCT TCTACGAGGT CACCGAGGGC
CGGATCACCC TGGACGGCGT CGACATCGCC GAGATGTCGC GGGCGGACCT GCGGCGATCC
ATCGGGATGG TGCTGCAGGA CACCTGGCTG TTCGGCGGCA CCATCGCGGA GAACATCGCC
TACGGCGCCG AGGGCGCCAC CCGCGAACAG GTCGTCGAGG CGGCCCGCGC CGCGCACGTC
GACCGGTTCG TGCGCACGCT CCCCGACGGC TACGACACCG TGCTCGAAGA CGAGGGCGTC
GGGGTCAGCG GCGGCGAGAA GCAGCTCATC ACCATCGCGC GCGCGTTCCT CGCCGAGCCG
CTGATCCTCG TCCTGGACGA GGCGACGAGC TCGGTGGACA CCCGCACCGA GGTGCTGATC
CAGCGGGCGA TGTCCCGGCT GCGCGCCGGC CGCACGGCCT TCGTGATCGC CCACCGGCTG
TCCACCATCC GGGACGCCGA CACGATCCTC GTCATGGAGG ACGGCGCGAT CGTCGAGCAG
GGCGACCACG ACGCCCTGCT GGAGGCGGAC GGCGCCTACG CGCGGCTGTA CAAGGCCCAG
TTCGCCCAGG CCGTCGTCGA GACCGGCTGA
 
Protein sequence
MTRPPAGGAR PMAGPGRFFA AGGEKPEDFA ASTRRVLRLL GRQRALLIPA VALAVVGIAL 
TVTGPRLLGH ATDLVFAGLL SSRLPAGTTK AEAVARLRAE GHGTQADLLS SVDVTPGEGM
DFGAIGRVLL VVLIVYLVAG LCTVLQARLA NKALQRMLSD LRADVQEKIT RLPLRYFDRR
QRGEVLSRVT NDIDNLGQNL QQSLSQMIAS VLTIIGVLAM MIWISWILAL IAVVTVPVSI
VITTRIGKFA QPQFVSQWKT TGRLNGHIEE MYTGHALVRA FGRQEESAEI FREHNERLYE
ASWRAQSISG LMQPAMMFIG NLNYVLVAVV GGLRVASGAL SIGDVQAFIQ YSRQFSQPLT
QLASMGNMVQ SGIASAERVF DLLDAPEQEP DPLAPARPEE NHGRVAFEHV AFRYEPDKPL
IDDLSLVAEP GHTVAIVGPT GAGKTTLINL LMRFYEVTEG RITLDGVDIA EMSRADLRRS
IGMVLQDTWL FGGTIAENIA YGAEGATREQ VVEAARAAHV DRFVRTLPDG YDTVLEDEGV
GVSGGEKQLI TIARAFLAEP LILVLDEATS SVDTRTEVLI QRAMSRLRAG RTAFVIAHRL
STIRDADTIL VMEDGAIVEQ GDHDALLEAD GAYARLYKAQ FAQAVVETG