Gene Francci3_2621 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_2621 
Symbol 
ID3906527 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp3094518 
End bp3096998 
Gene Length2481 bp 
Protein Length826 aa 
Translation table11 
GC content70% 
IMG OID637879946 
ProductABC transporter related 
Protein accessionYP_481712 
Protein GI86741312 
COG category[L] Replication, recombination and repair 
COG ID[COG0178] Excinuclease ATPase subunit 
TIGRFAM ID[TIGR00630] excinuclease ABC, A subunit 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.12429 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCACGG CCATGAGCAC GGCCATGAGC ACGGCCACGA GGACGGACCC GCGGTCGCCG 
GACCCGCGGT CGCCGGACCC GCGGTCGCCG GACCCGCGGT CGCCGGCGCC GCACGTTGCC
GACCGCAGCG ATGTGATCCG CGTGCACGGC GCGCGCGAGA ACAACCTCAG GGACGTCAGC
ATCGAGATCC CGAAGCGCCG GCTGACGGTG TTCACCGGCG TCTCCGGCTC GGGCAAGAGC
TCGCTGGTGT TCGGCACGAT CGCCGCCGAG TCGCAGCGGC TGATCAACGA GACGTACAGC
GCCTTCGTGC AGGGCTTCAT GCCGACGCTG GCGCGGCCCG AGGTCGACGT ACTCGAAGGG
CTGACAACCG CGATCATCGT CGACCAGCAG CGGATGGGAG CCGACGCCCG CTCCACGGTC
GGCACCGCCA CCGACGCGAA CGCGATGCTG CGCATCCTGT TCAGCCGGCT CGGCGAGCCA
CACATCGGCT CGCCCAACGC GTTCTCCTTC AACGTCCCCT CGGTCCGAGC GGCCGGCGCG
GTCACGGTCG AACGCGGTGC CGGTAAGACC AAGACCGTGA AACAGACCTA CACCCGCGCC
GGCGGCATGT GCCCGCGCTG CGAAGGCCGG GGCACGGTCT CCGAGATCGA CCTCACCCAG
CTTTTCGACG ACTCCAAGTC GCTCGCCGAG GGCGCGATCA CCATTCCCGG CTACAAGGTG
GACGGCTGGT GGACGGTGGG GATCTTCATC GAGTCGGGCT TCCTCGACCC GAACAAGCCG
ATCCGCCAGT ACACGAAGAA GGAACTCCGG GACTTCCTCT ACAAGGAACC GACCAAGGTG
AAGGTCAACG GTGTCAACCT CACCTACGAG GGGCTGGTCC CCAAGGTCCA GAAGTCGTTC
CTCGCCAAGG ATCCCGACGC CCTGCAGCCG CATATCCGGG CGTTCGTGGA CCGGGCGGTC
ACCTTCACCG CCTGTCCCGA CTGCGGCGGC ACCCGGCTCA GCGAGGCGGC TCGGTCCTCC
AGGATCAAAG GGATCAACAT CGCCGACGCC TGCGCGATGC AGATCAGTGA CCTGGCCGAA
TGGGTCCGCG GCCTCGACGA GCCGTCGGTC GCGCCGCTGC TCGCCACGCT GCGGCGGACC
CTCGACTCGT TCGTGGAGAT CGGGCTGGGC TACCTCTCGC TCGACCGGCC GTCGGGCACG
CTGTCGGGCG GCGAGGCGCA GCGGGTCAAG ATGATCCGCC ACCTCGGCTC CTCCCTCACC
GACGTCACCT ACGTCTTCGA CGAGCCCACC ACGGGCCTGC ACCCCCATGA CATCCGGCGG
ATGAACGACC TGCTGCTACG GCTGCGGGAC AAGGGCAACA CGGTGCTCGT CGTGGAGCAC
GAGCCGGAGA CGATCGCGAT CGCCGACCAC GTCGTCGACC TCGGCCCCGG CGCCGGAACA
GCGGGCGGCA CCGTCTGCTT CGAGGGCACC GTCGAGGGGC TGCGGGTCAG CGGCACCCGC
ACCGGCCGCC ATCTCGACGA CCGGGCCGCC CTCAAGGAGG CGGTGCGGAC GCCCACCGGC
CGGCTAGCGA TCCGCGGCGC GACGGCGCAC AACCTGCGCG GCGTCGACGT CGACATCCCG
CTCGGGGTGC TTGTCGTCGT CACCGGCGTC GCCGGCTCCG GCAAGAGCTC GCTCGTGCAC
GGGTCGATCC CCCATCTCTC GGGGCCAGCC GGCGCGGGTG TGGTGTCGAT CGACCAGGGC
GCGATCCGCG GCTCGCGACG GAGCAACCCG GCGACGTACA CCGGACTGCT CGACCCGATC
CGCAAGGCGT TCGCGAAGGC GGGCGGCGTG AAGCCGGCGC TGTTCAGCGC CAACTCCGAG
GGCGCCTGCC CCACCTGCAA CGGCGTCGGG GTCATCTACA CCGACCTGGC GATGATGGCC
GGCGTCGCCA CCACCTGCGA GGAGTGCGAG GGGAAACGGT TTGAAGCATC GGTGCTGGAG
CACCATCTCG GCGACCGCGA CATCAGTGAG GTGCTCGCGA TGTCGGTGAC CGAGGCCGAG
GAGTTCTTCG GCGCCGGGGA GGCGCGCACG CCGGCCGCGC ACGCCATCCT CAACCGGCTC
GCCGACGTCG GGCTCGGCTA CCTCAGCCTC GGCCAGCCGC TCACCACGCT GTCCGGCGGC
GAGCGGCAGC GGCTCAAGCT GGCCACCCAC CTGGCCGAGA AGGGCGGCGT CTACGTCCTC
GACGAGCCGA CCACCGGCCT GCACCTCGCC GACGTCGAGC AGCTGCTCGG CCTGCTCGAC
CGGCTCGTCG ACGCCGGCAA GTCGGTCATC GTCATCGAGC ACCACCAGGC GGTCATGGCG
CACGCCGACT GGATCATCGA CCTCGGTCCC GGCGCTGGCC ACGACGGCGG CCGGATCGTC
TTCGAGGGCA CCCCCGCCGA CCTCGTCGCA GCCCGCTCCA CCCTCACCGG CGAGCACCTC
GCGGCCTACG TCGGCACCTG A
 
Protein sequence
MSTAMSTAMS TATRTDPRSP DPRSPDPRSP DPRSPAPHVA DRSDVIRVHG ARENNLRDVS 
IEIPKRRLTV FTGVSGSGKS SLVFGTIAAE SQRLINETYS AFVQGFMPTL ARPEVDVLEG
LTTAIIVDQQ RMGADARSTV GTATDANAML RILFSRLGEP HIGSPNAFSF NVPSVRAAGA
VTVERGAGKT KTVKQTYTRA GGMCPRCEGR GTVSEIDLTQ LFDDSKSLAE GAITIPGYKV
DGWWTVGIFI ESGFLDPNKP IRQYTKKELR DFLYKEPTKV KVNGVNLTYE GLVPKVQKSF
LAKDPDALQP HIRAFVDRAV TFTACPDCGG TRLSEAARSS RIKGINIADA CAMQISDLAE
WVRGLDEPSV APLLATLRRT LDSFVEIGLG YLSLDRPSGT LSGGEAQRVK MIRHLGSSLT
DVTYVFDEPT TGLHPHDIRR MNDLLLRLRD KGNTVLVVEH EPETIAIADH VVDLGPGAGT
AGGTVCFEGT VEGLRVSGTR TGRHLDDRAA LKEAVRTPTG RLAIRGATAH NLRGVDVDIP
LGVLVVVTGV AGSGKSSLVH GSIPHLSGPA GAGVVSIDQG AIRGSRRSNP ATYTGLLDPI
RKAFAKAGGV KPALFSANSE GACPTCNGVG VIYTDLAMMA GVATTCEECE GKRFEASVLE
HHLGDRDISE VLAMSVTEAE EFFGAGEART PAAHAILNRL ADVGLGYLSL GQPLTTLSGG
ERQRLKLATH LAEKGGVYVL DEPTTGLHLA DVEQLLGLLD RLVDAGKSVI VIEHHQAVMA
HADWIIDLGP GAGHDGGRIV FEGTPADLVA ARSTLTGEHL AAYVGT