Gene Francci3_2118 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_2118 
Symbol 
ID3905508 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp2484319 
End bp2486136 
Gene Length1818 bp 
Protein Length605 aa 
Translation table11 
GC content73% 
IMG OID637879453 
Productoligopeptide/dipeptide ABC transporter, ATP-binding protein-like 
Protein accessionYP_481219 
Protein GI86740819 
COG category[R] General function prediction only 
COG ID[COG1123] ATPase components of various ABC-type transport systems, contain duplicated ATPase 
TIGRFAM ID[TIGR02323] phosphonate C-P lyase system protein PhnK 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.227418 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.486625 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCCCGG ACGTGACCGC CGCCGAGCCG ACCCTGGTCG TCGAGCGGCT GGACGTGACG 
TTTCGGGACG GCGTCGCCGG CGTGCGTGCG GTGCGGGACG TGTCGATCGC CGTCCGTCCC
GGCGAGTGTC TCGCGGTCGT GGGCGAGTCC GGTGCCGGCA AGAGCGTGCT CGCCCGGACG
CTGATCGGAC TGGCCGGGCG CGGTGCGATG GTCCGTGCTG GCCGGCTGGA CCTGCAGGGC
GTGGATCTGA CCGCCTTGAC CGAACCGGGG TGGCGGGTGT TGCGAGGCCG CCGGATCGGG
TTGGTGCCCC AGGACGCGTT GGCCTCCCTG GATCCGCTGC GGACGGTGGG AGCGGAGGTG
GCCGAGCCCC TGCGCGTCCA CCGGATCGTG GCCCGTCGCG ACGTCCGCGA GCGGGCCGTC
GCGACGCTTG GGCAGGTGGG GGTGCCGGAG CCGGCCCGGC GCGCCGGGCA GTACCCGCAT
CAGCTCTCTG GCGGTCTGCG ACAGCGGGCC CTGATCGCCT CGACGGTCGC GGCCGGACCC
GACCTGCTGC TGGTCGACGA ACCCACGACC GCGTTGGATG CCGCCTCCCG GGAGCGGATC
GTCGAGGTGC TGCGCGGCCT GGTACGCGGC GGTGTCGCTC TGCTGTTGAT TAGTCACGAT
CTTGCCACGG TGGCCGCCGT CGCCGACCGG GTGGCGGTCA TGTACGAGGG GCGGATCATC
GAGCAGGGAC CGGCCGTCGA CGTGCTTGGC GGTCCACGGC ATCCGTACAC CTGCGCGTTG
CTCGCGGCGG CCCCCTCGCG GCACTCGCGG GGCACCGTGC TGTCCCCGGA CCTGCCGCGC
CGTCCACCGG CCGGGCCGGA CGGCTGCCCA TACGCGGTGC GCTGCCCACT CGCCGACCAC
TGGTGCCGCG AGGAACTGCC GCGCCCCGAT CACCCGGGTC TCGAACCCGG CGTCCTGTGC
TGGCGGCCCG GAACGGAAAC GGAGCGGGCC GGACCGCCAC GGGTCGTCGC CCCGGCCCGC
AGGGACACCG CCGAGGCTCT CGTCGAAGCC ACCGGCATCA CCAAACGTTT CCGCGATCCC
GACGGGGGGT GGCGGGACGC CGTCCGTGCC GTGACGTTCG AGCTTCGTGC CGCCGAGACA
CTCGGCGTCA TCGGTGGGTC CGGGTCCGGC AAGACCACGC TGGCCCGCAT CGTGCTCGGC
CTGCTCGAAC CGGACGAGGG TACCGTCCGG TTCGCCGGCG CGCCCTGGGT GGCGGCCGCG
GCGGCGACCG CCTCGCCCAG CCGGGTACGC GAACGTGACC GCAGGCCTCG GCGGCACCGG
ATACAGGCCG TCCACCAGGA CTGCCTGAGC TCATTCGACC CGCGTCACAC CGCCGAGCGG
ATCGTCGGCG ACGCCATGTC CGGTCCGGAT CGGGGCCGGG CGCGACGAGA TCGGATCGTC
GCACTCCTCG ACCAGGTAGG GCTGTCCGAA CAGGTGCTGC GACGCCACCC TCGCGAGCTG
TCCGGCGGGC AGCGTCAGCG GCTGGCGATC GCGCGTGCGC TCGCGCCGTC ACCCGAGGTC
CTCGTCTGCG ACGAACCGGT GTCCGCGCTC GACCTGTCGG TGCAGGCCCA GATCCTCGAC
CTGCTGGCGG GACTGCGCGA CGAACTCGGC TTGGCCCTGC TGTTCATCTC CCACGATATC
GCAGTGATCC GGCACGTCAG CGACCGCGTC CTGGTGATGA AGGACGGGCA GGTGGCCGAG
ATCGGCGGCG CGGAGCAGGT GCTCGAGCGC CCCGCGCACC CGTACACCCG GCATCTGCTG
GCCGCGGCCC GCACCTGA
 
Protein sequence
MSPDVTAAEP TLVVERLDVT FRDGVAGVRA VRDVSIAVRP GECLAVVGES GAGKSVLART 
LIGLAGRGAM VRAGRLDLQG VDLTALTEPG WRVLRGRRIG LVPQDALASL DPLRTVGAEV
AEPLRVHRIV ARRDVRERAV ATLGQVGVPE PARRAGQYPH QLSGGLRQRA LIASTVAAGP
DLLLVDEPTT ALDAASRERI VEVLRGLVRG GVALLLISHD LATVAAVADR VAVMYEGRII
EQGPAVDVLG GPRHPYTCAL LAAAPSRHSR GTVLSPDLPR RPPAGPDGCP YAVRCPLADH
WCREELPRPD HPGLEPGVLC WRPGTETERA GPPRVVAPAR RDTAEALVEA TGITKRFRDP
DGGWRDAVRA VTFELRAAET LGVIGGSGSG KTTLARIVLG LLEPDEGTVR FAGAPWVAAA
AATASPSRVR ERDRRPRRHR IQAVHQDCLS SFDPRHTAER IVGDAMSGPD RGRARRDRIV
ALLDQVGLSE QVLRRHPREL SGGQRQRLAI ARALAPSPEV LVCDEPVSAL DLSVQAQILD
LLAGLRDELG LALLFISHDI AVIRHVSDRV LVMKDGQVAE IGGAEQVLER PAHPYTRHLL
AAART