Gene Francci3_1472 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_1472 
Symbol 
ID3903109 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp1763580 
End bp1764695 
Gene Length1116 bp 
Protein Length371 aa 
Translation table11 
GC content72% 
IMG OID637878810 
Productdaunorubicin resistance ABC transporter ATP-binding subunit 
Protein accessionYP_480578 
Protein GI86740178 
COG category[V] Defense mechanisms 
COG ID[COG1131] ABC-type multidrug transport system, ATPase component 
TIGRFAM ID[TIGR01188] daunorubicin resistance ABC transporter ATP-binding subunit 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.487769 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.269585 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACAGACA ACCCGACGGT CCTCACCGAG GGCCTGCAGA AGAGCTACGG CGGCAAACGA 
GCGCTGGTGG GCCTGGACCT GCGGGTCGAT GCCGGCACCG TGCACGGTGT GCTGGGGCCC
AACGGAGCCG GCAAGACGAC GGCGGTACGC ATCCTGTCGA CCCTGCTGCG CCCGGACGGC
GGCCAGGCCA GGGTCCTCGG TCTGGACGTC GTCCGGCAGG CGGACGCGCT GCGTGCCCGG
ATCGGTCTGA CCGGCCAGTA CGCGGCGGTC GACGAGCGGC TCACCGGCCT GGAGAACCTG
GAGATGTTCG GGCGGCTCTA CCGTCTGCGC AGCCGCGTCG CGCGGGCCCG CGCGGTCGAG
CTGCTCGAGC GGTTCGAGCT CGCCGAGTCG GCCGGTCGGC AGGCGCGTAC CTACTCGGGC
GGGATGCGGC GGCGGCTCGA TCTCGCGGCC AGTCTGATCA TCAACCCGGC GGTGCTGTTC
CTCGACGAGC CGACCACCGG ACTGGATCCC CGCAGCCGCT CGGCGATGTG GAGGGTGATC
GCCGATCTGG TGCGGGACGG CACGACGGTC CTGCTCACGA CGCAGTACCT GGAGGAGGCC
GACCGGCTGG CCCACCGGAT CTCGGTGGTC GACGGCGGCC GGGTCATCGC GCGGGGTACC
CCGAACGAGC TGAAGTCGCA GGTCGGTGGC GACCGGCTGG ACATCGTCGT CGGTGGTGAC
GCCGCCGAGG ATGTCCAGGC CGCCCGGGCG GTGCTCGCCC GGTTCGGATC CGGCGCGGCG
ACGGCCGACG GGGAGACGCG TCGGGTCAGT GTGCCGGTAC CGGGCGCGAG CGTGCTGTCC
GATGTGGTCC GGGGTCTCGA CGAGCTGGGG GTGCCGATTT CCGACGTCGC CCTGCACCGG
CCGTCGCTGG ACGACGTCTT CCTCGCTATG ACCGGGCGCG CTTCGGCCGG CAGCGAGACG
CCGCCCGCAC GTGCGGGCGG CGACGGCCGT GGCGACACCG GCAACCCTGA TGACCGGGAC
GACCGGGACG ACCGGGACGA CCGGGACGAC CGGGATGACA AGGACGACCG GGATGACAAG
GATGACAAGG ATGACAAGGA CATGGTGACC ATATGA
 
Protein sequence
MTDNPTVLTE GLQKSYGGKR ALVGLDLRVD AGTVHGVLGP NGAGKTTAVR ILSTLLRPDG 
GQARVLGLDV VRQADALRAR IGLTGQYAAV DERLTGLENL EMFGRLYRLR SRVARARAVE
LLERFELAES AGRQARTYSG GMRRRLDLAA SLIINPAVLF LDEPTTGLDP RSRSAMWRVI
ADLVRDGTTV LLTTQYLEEA DRLAHRISVV DGGRVIARGT PNELKSQVGG DRLDIVVGGD
AAEDVQAARA VLARFGSGAA TADGETRRVS VPVPGASVLS DVVRGLDELG VPISDVALHR
PSLDDVFLAM TGRASAGSET PPARAGGDGR GDTGNPDDRD DRDDRDDRDD RDDKDDRDDK
DDKDDKDMVT I