Gene Franean1_1666 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_1666 
Symbol 
ID5670068 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp1992942 
End bp1994939 
Gene Length1998 bp 
Protein Length665 aa 
Translation table11 
GC content72% 
IMG OID641240584 
ProductABC transporter related 
Protein accessionYP_001506010 
Protein GI158313502 
COG category[V] Defense mechanisms 
COG ID[COG1132] ABC-type multidrug transport system, ATPase and permease components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.600713 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTCTGGCC CGTGTCTGTC GGGCGGGCCG GTGGACGTCC CGTCGACCAG GGTGACCCGC 
CTGCACGCCC GGGTGGTGGC GAAGCTCACC GGCGTCGACG AGTCCGCCCG CAGCCTGCGC
TTCTTCGCCG ACCAGGGCCG GGTGATCACC ACCCTGGTCG GCCTGTCGGT CATCGGCTCG
GTCGCGGTCG CGGCCGGCCC GTTCCTGCTG CGCCACCTCA TCGACGACTC CCTGCCGACC
GGACAGGTCG GCGACCTCGT CGTCCCGGTG CTCCTCCTGT GCGCGCTGCT GGTGTTCGAG
TCGGCAGTGC TCGCCACCCG GATGGCGCTC ATCGCCCGGC TCGGCGCCGT GATCACGGTG
CGGACCCGGC AGGCCGTCAA CGCCCACCTG CAACGCCTGC CGTTCGGCTT CTTCCCCCGC
AGCCAGCAGG GCGAGGTGAT GACGGTGATG TCCACCGACG TCATCACCGC CCAGTACGCG
ATCTCCGCGG TCGTCCAGGC GGTGATCTGC CGGGTCGCTG ACATCGCCGT CGGCACCGCC
GTCGTCTTCG TGCTGGACTG GCGGCTCAGC CTGGCGGTCA TGGTCTTCGC CCCCGCCACA
CTGCTGATCA TGAAGTCCGG CCGCCGCCGC CTCGCCGGCA TCTCCCACCG CCAACGAGAG
CTGGACGGCC AGCTCATGGC CCAGGTGGCG GACACCTCCT CGGTCTCGGG TGCGCTGCAC
GTCCGGCTGT TCGACCGGGC GGACCATGAA TGCGAACGCT TCGACGCCGC CGCCGCCGAG
CTGCTCGCCG CCAGCCGTGA GCAGGCCCGG CTGACCTCGC GTGTCCGCCT CGTCGTCAAC
CTCGGCATCG TCATGACGAT GACGGTGGTC GTTACCCTCG GCGCCTGGCT GGTCTCCACG
GGACACACCT CGCTCGGGAC GGTCGCCGCG CTCGGCGGCG CACTCCTGGT CTCCTTCGGC
CCGCTCGCCT CCGCCGTCGA GCTCCGCTCC GAGCTCTCCA GCGCCGGCGC GTCATTCCGC
CGGATCTTCG CCCTGCTCGA CCTCCCAGTG GCCGCGGCCC GCCCGGTGGC CACAGCCCGC
CCCGTCGACC CTGCACGCCC GCCGCTCCAG CAGCGCCCGC CGGCCCAGCT GCGCACGGTG
GACCGACAGC CTGCGACGGT CGAGCACCGT CCGCGGGTCC CCGTCGCCTC GGTGCCGGAC
CTCACCCTTG ACGACGTCTG GTTCTCCTAC GACATCGCCA CCTCCGACCC ACGCCTCACC
GCCCCGGATC GCCCCGCCGC GCTGGATCGC CCCACCACAC CGGATCGCGA CGACGCGGCG
TGGAACCTGC GCGGTGTCAC CCTGCGGGTT GCTCCGGGGG CGACGACCGC GGTCGTCGGG
GCGAGCGGCG CCGGCAAGAC GACCATCACC TACATCGCCA GCGGCATCCA CCGCCCCCAG
CACGGCACCA TCCGGCTCGG CGGCGTGGCC TTGGACGACA TTCCGTCATC CGAGCTGCAT
TCGCTGATCG GGGTGGTTCC CCAGGACCCG CACCTGTTCC ACGACACCAT CGCGGCGAAC
CTGCGCTACG GACGGCTCGA CGCCACTGAC GACGAGCTGC GCGACGCGCT CGACGCCGCC
AACCTCTCGG CGCTGCTCAA CCAGCTGCCG GACGGGCTGC GCACCCGTGT CGGCGCTCGC
GGCTACCGGC TTTCCGGCGG CGAGCGTCAA CGGCTCGCCA TCGCCCGGGT CCTGCTGCAG
TCGCCCCAGG TGCTGATCCT GGACGAGGCC ACCTCGGCGC TGGACACCGT CTCCGAGGCG
GCCGTCCGCG CCGCGCTCGA CCGGCTCTCG GTCGGCCGGA CCTGCCTGGT CATCGCGCAC
CGGCTCTCCA CCGTGATCGA TGCCGACCGC ATCTACGTCA TGGACCACGG CCGGGTGGTC
GAGGACGGCA CCCACGGCGA GCTGCTCGCC GATGATGGCG CCTACGCCCG CCTGTACCGC
CCGGTCACGG TGGGCTGA
 
Protein sequence
MSGPCLSGGP VDVPSTRVTR LHARVVAKLT GVDESARSLR FFADQGRVIT TLVGLSVIGS 
VAVAAGPFLL RHLIDDSLPT GQVGDLVVPV LLLCALLVFE SAVLATRMAL IARLGAVITV
RTRQAVNAHL QRLPFGFFPR SQQGEVMTVM STDVITAQYA ISAVVQAVIC RVADIAVGTA
VVFVLDWRLS LAVMVFAPAT LLIMKSGRRR LAGISHRQRE LDGQLMAQVA DTSSVSGALH
VRLFDRADHE CERFDAAAAE LLAASREQAR LTSRVRLVVN LGIVMTMTVV VTLGAWLVST
GHTSLGTVAA LGGALLVSFG PLASAVELRS ELSSAGASFR RIFALLDLPV AAARPVATAR
PVDPARPPLQ QRPPAQLRTV DRQPATVEHR PRVPVASVPD LTLDDVWFSY DIATSDPRLT
APDRPAALDR PTTPDRDDAA WNLRGVTLRV APGATTAVVG ASGAGKTTIT YIASGIHRPQ
HGTIRLGGVA LDDIPSSELH SLIGVVPQDP HLFHDTIAAN LRYGRLDATD DELRDALDAA
NLSALLNQLP DGLRTRVGAR GYRLSGGERQ RLAIARVLLQ SPQVLILDEA TSALDTVSEA
AVRAALDRLS VGRTCLVIAH RLSTVIDADR IYVMDHGRVV EDGTHGELLA DDGAYARLYR
PVTVG