Gene Francci3_4242 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_4242 
Symbol 
ID3907208 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp5060834 
End bp5062768 
Gene Length1935 bp 
Protein Length644 aa 
Translation table11 
GC content64% 
IMG OID637881568 
ProductABC transporter related 
Protein accessionYP_483317 
Protein GI86742917 
COG category[V] Defense mechanisms 
COG ID[COG1132] ABC-type multidrug transport system, ATPase and permease components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.235556 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACAGCG TGGCCGGCGG GGTTCCGCAT CAGGAGATAC GGTCGCCTGG CATCGGTTCC 
CGTGGCGCCG GTCAAAACGG CACCGGCCTG GACGATGTCG GCACCGCCGT GGACAACCTC
GTTTCCCGCC GGGTGGAGCG GCATGCCGCC CTGCGTTCCT ATCTGCGTTT CGTCGCGACC
TTCAAGGGGC CGGCTCTCCT GGTCGTCGGT GTCTTCGTCC TGTCCAACAG CCTGCTCGCC
GTGATACCGG TCCTCATCGG CCTTCTCGTG CAGGCTCTCG CCAGCGCCCC GGTACAGACC
CACGACGCGT ACCTGTACGC CGGCGTCCTT CTCGCGTGCA ACATCGGCCA TGACCTGACC
TGGCGTGGTG CCGAGGTGCT GTATCTCCGG CTACTCAACC ACCGGGGCTA CGAATACGAG
AACATTCTCT TTGGAAATGT CATAAACGAG CGATATCCAT ACTTCGTCGG CAAGTTCACC
GGAAAGATCA GCAGCTATGT GAGCACGCTC GGCCGCGAGT TCCGAGAGAT CCTCGAGACG
GCGTGCTTCA CGTACGTTGA ACAGCTGGTC AAGGTTCCGG CGATCCTAAT GATCATGTTT
TCGGTTAACC TTTACTCGGG CCTGACCTTC CTGGGTTCCG TCGCGATCAT GCTTCTTGTT
GGCAGGCACA CCGTCCGTCG TTCGGCCCGC GCGGAGAAGC GGCTCGCGGA CGAGGTCTCG
GAGATGGACG GTTACGTGAT CGACGTCATC TCGAACTTCG TGAGCGTGAA GTCGTTCCTC
CAGGAGGAGG CCGAGTACGA GCGGGTCCGC CGGCGTCGTC GCCAGGTCAT CGCCGCCGCG
TATCGCTCGT CCTTCTGGAG CATTGTGTTC TGGACCTCGA TGAGCCTCGT GGTCCGGTAC
CTCGTCTGGC CGGTGTCGAT CGTCCTCAAC CTGTATCTAT TCCTCCACGG TGAGATGAGC
CTGGCCCAGT TCACGACCTT TCTCTCCGCG CTGGTCCTCT TCTCCGACTT TATCTGGGGG
ACGGTGTGGG AGATTGCGCA GCTCAATCTC CGACTCGCCC GCATCGAGGA GGCCTACCAC
TACCTCTTCG CCGGTCGGAA CATCCTGTCA GCCGGTCCGG ATAATTCGGC CAGTCCGGAT
AATTCAGCCG GTTCAGCTGG TTCAGGCCGT CCTGCGGTTC TTCCCGCGGT TTCCGGTCGG
CTGCAGTTCC AGGGCCTGTC GTTCGCCTAT CCCGAGAACG ACCGGAACCT CGTGCTCGCG
GGCATCGACC TGGCCATCGA GCGGGGCGAA AAGATCGGCA TCGTGGGTCG CAGCGGAAGC
GGCAAGACGA CGTTCATCAA GCTGTTGCTA GGCTACTACC CGCTCCCACC GGGCATGGTC
GCCGTGGACG GTGTTCCGGT GTCCAACCGC CAGCTCGCTC GGGGTATCTC CTACGTTCCG
CAGGACACCA CCCTCTTCCA CCGGTCCATC CGGGACAACA TCGTCTACGG GACCGCGGGC
CCGGTGACCC AGGCGCAGGT GGAGGCGGCG GCGCTGCGGG CCCACGCGCA CAGCTTCATC
TCGCGGGCGC CCGACGGTTA CGACACCGTC GTCGGGGAAC GTGGCATCAA ACTCTCGACC
GGGCAACGGC AGCGGATCGC CATTGCCCGC GCCTTCCTGG ACGACAAACC CATCCTCATT
CTGGACGAGG CGACGAGCGC GCTGGACAGC GAGAGCGAGG TACTCGTCCA GAACGCACTC
GAGAACCTGT GGCGAGACAA GACGGTGATC GCCGTCGCGC ACCGGCTGTC GACCCTGCTG
CACATGGACC GCATCGTCGT CCTCGACGAC GGCCGCGTCG TGGAGCAGGG GTCCCATCGC
GAGCTGCTCG AGCGGCGGGG GCGCTACCAC CGGCTGTGGC AGCGGCAGAG CGGAGGAATG
ATCCCGGTCG AGTAG
 
Protein sequence
MDSVAGGVPH QEIRSPGIGS RGAGQNGTGL DDVGTAVDNL VSRRVERHAA LRSYLRFVAT 
FKGPALLVVG VFVLSNSLLA VIPVLIGLLV QALASAPVQT HDAYLYAGVL LACNIGHDLT
WRGAEVLYLR LLNHRGYEYE NILFGNVINE RYPYFVGKFT GKISSYVSTL GREFREILET
ACFTYVEQLV KVPAILMIMF SVNLYSGLTF LGSVAIMLLV GRHTVRRSAR AEKRLADEVS
EMDGYVIDVI SNFVSVKSFL QEEAEYERVR RRRRQVIAAA YRSSFWSIVF WTSMSLVVRY
LVWPVSIVLN LYLFLHGEMS LAQFTTFLSA LVLFSDFIWG TVWEIAQLNL RLARIEEAYH
YLFAGRNILS AGPDNSASPD NSAGSAGSGR PAVLPAVSGR LQFQGLSFAY PENDRNLVLA
GIDLAIERGE KIGIVGRSGS GKTTFIKLLL GYYPLPPGMV AVDGVPVSNR QLARGISYVP
QDTTLFHRSI RDNIVYGTAG PVTQAQVEAA ALRAHAHSFI SRAPDGYDTV VGERGIKLST
GQRQRIAIAR AFLDDKPILI LDEATSALDS ESEVLVQNAL ENLWRDKTVI AVAHRLSTLL
HMDRIVVLDD GRVVEQGSHR ELLERRGRYH RLWQRQSGGM IPVE