Gene Francci3_0379 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_0379 
Symbol 
ID3903430 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp448713 
End bp450674 
Gene Length1962 bp 
Protein Length653 aa 
Translation table11 
GC content67% 
IMG OID637877708 
ProductABC transporter related 
Protein accessionYP_479495 
Protein GI86739095 
COG category[V] Defense mechanisms 
COG ID[COG1132] ABC-type multidrug transport system, ATPase and permease components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCAGCC CGGTACCGGC ACCGCGCCGC GGCCCGGCCC CGACGGCCGG GCCCGCTCGG 
TTCTTCGTGG GCCCGGGCGC GACGGAGAAG TCGATGGACT TCGCCGCGTC GAGCCGCCGG
CTGCTCGGAC TGCTGCGACC GCAGCGCCAC CTGCTGATCG CCGTCGTGGT CCTGGCGACG
TTGAGCGTCG GCCTGAGCGT GATCGGTCCT CGCCTGCTCG GCCACGCCAC CGACCTGGTG
TTCGCGGGGA TCTTCAGTCG GCGGCTGCCA GCCGGCAGCA CGAAGGAACA GGCGGTCGCG
CAGCTGCGTG CCACCGGCCA CGGTACCCAG GCGGACCTGC TCGCCTCGCT CGACGTGACC
CCCGGTCATG GCATGGACTT CGATGCCATC GGTGTCGTGC TGCTGGGGGT CGTGGTCGTC
TACGCGGTGT CCGGATTGTG CGGTGTGCTC CAGGCCCGGC TGGCGAACCT GGCCCTGCAG
AGAGTGATCA GTGATCTGCG CCGCGACGTC CAGGCGAAGA TCTCGCGGTT GCCCGTCCGT
TACTTCGACG GTCAGCCGCG AGGCGAGGTG CTCAGCCGGG TCACCAACGA CATCGACAAC
TTCGGGCAGA GCCTGCAGCA GAGCATGTCG CAGATTGTCG CGTCGACGCT GACGATCGTG
GGCGTGCTAT CGATGATGCT CTGGATCTCC TGGATCCTCG CCCTGATCGC GCTGGTCACG
GTGCCAGTGT CGATCATGAT GGCGACCCGG GTCGGCAAGC TGGCTCAGCC CCAGTTTGTC
GCCCAGTGGC GGATCACCGG TGGTCTCAAC GCCCATATCG AGGAGATGTA CACCGGGCAC
ACGCTCGTGC GGGCGTTCGG CCGGCAGGAG GAATCCGCGG AGATCTTTCG TGAGCAGAAC
GAGCGGCTCT ACGCCGCCAG TTGGCGCGCG CAGTTCATCT CCGGTCTCAT GCAGCCCTCG
ATGTTGTTCA TCGGAAACCT GAACTACGTG CTGGTAGCTG TGATCGGCGG CCTGCGGGTC
GCGTCCGGCT CGCTGTCGAT CGGTGACGTC CAGGCGTTCA TCCAATATTC TCGGCAGTTC
AGCCAGCCGT TGTCCCAGGT TGCCAGCATG GCGAACCTGG TGCAGTCCGG GGTGGCCTCC
GCGGAGCGGG TGTTCGACCT GCTCGACACC GTCGAGCAGG AACCCGACCG GGCGGTTCCG
GCGCATCCGG AGCGGCTTCG TGGCCGGGTG GCCTTCGAAC GCGTCTCCTT CCGCTACGAA
CCGAACAAGC CGTTGATCGA GGACCTGTCA CTTGTCGCGG AACCTGGTCA CACCGTCGCG
ATCGTCGGGC CGACCGGCGC CGGCAAGACC ACTCTGATCA ACCTGCTGAT GCGCTTCTAC
GAGGTCACCG GCGGACGGAT CACCTTGGAC GGGGTCGATG TCGCGGCGAT GTCGCGGGAT
GAACTGCGGG CCAGCATCGG CATGGTCCTG CAGGACACCT GGCTGTTCGG CGGAACGATC
GCGGAGAACA TCGCCTACGG TGCGGAGGGC GCCACCCACG AGCAGATTGT GGCGGCGGCC
CGCGCCGCGC ACGTCGACCG GTTCGTCCGC ACTCTTCCCG CGGGTTACGA CACCGTGCTC
GACGACGAGG GGGTCGGCGT GAGCGGCGGG GAGAAGCAGC TGATCACCAT CGCCCGCGCC
TTCCTCGTGG AACCTCTGAT CCTGGTTCTC GACGAGGCGA CGAGCTCGGT GGACACCCGT
ACCGAGGTCC TCATCCAACG GGCGATGTCG ACCCTGCGGG CGGGGCGGAC CGCGTTCGTC
ATCGCGCATC GGCTGTCGAC CATCCGGGAC GCGGACACCA TTCTCGTGAT GGAGAACGGA
GCCATCGTCG AACAGGGCAC CCACACCGAG CTCCTGGCCG CCGATGGCCC ATACTCCCGG
CTGTACCAGT CCCAGTTCGC CCAGGCCGTC GTCGAGATCT GA
 
Protein sequence
MSSPVPAPRR GPAPTAGPAR FFVGPGATEK SMDFAASSRR LLGLLRPQRH LLIAVVVLAT 
LSVGLSVIGP RLLGHATDLV FAGIFSRRLP AGSTKEQAVA QLRATGHGTQ ADLLASLDVT
PGHGMDFDAI GVVLLGVVVV YAVSGLCGVL QARLANLALQ RVISDLRRDV QAKISRLPVR
YFDGQPRGEV LSRVTNDIDN FGQSLQQSMS QIVASTLTIV GVLSMMLWIS WILALIALVT
VPVSIMMATR VGKLAQPQFV AQWRITGGLN AHIEEMYTGH TLVRAFGRQE ESAEIFREQN
ERLYAASWRA QFISGLMQPS MLFIGNLNYV LVAVIGGLRV ASGSLSIGDV QAFIQYSRQF
SQPLSQVASM ANLVQSGVAS AERVFDLLDT VEQEPDRAVP AHPERLRGRV AFERVSFRYE
PNKPLIEDLS LVAEPGHTVA IVGPTGAGKT TLINLLMRFY EVTGGRITLD GVDVAAMSRD
ELRASIGMVL QDTWLFGGTI AENIAYGAEG ATHEQIVAAA RAAHVDRFVR TLPAGYDTVL
DDEGVGVSGG EKQLITIARA FLVEPLILVL DEATSSVDTR TEVLIQRAMS TLRAGRTAFV
IAHRLSTIRD ADTILVMENG AIVEQGTHTE LLAADGPYSR LYQSQFAQAV VEI