Gene Franean1_4179 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_4179 
Symbol 
ID5672534 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp4967846 
End bp4970233 
Gene Length2388 bp 
Protein Length795 aa 
Translation table11 
GC content67% 
IMG OID641243052 
ProductABC transporter related 
Protein accessionYP_001508469 
Protein GI158315961 
COG category[L] Replication, recombination and repair 
COG ID[COG0178] Excinuclease ATPase subunit 
TIGRFAM ID[TIGR00630] excinuclease ABC, A subunit 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0706824 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.286625 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCAACG CCACGAGGAC GGACCCGCAG CCGCCCGCGC CCCACCTCGC CGACAGCCAC 
GACCTCATCC GCGTCCACGG CGCCCGCGTG AACAACCTCA AAGACCTCAG CGTCGAACTC
CCCAAACGCC GACTCACCGT GTTCACCGGC GTCTCCGGCT CAGGCAAAAG CTCCCTGGTC
TTCGGCACCA TCGCCGCGGA ATCCCAGCGA CTGATCAACG AAACCTACAG CGCCTTCGTC
CAAGGCTTCA TGCCCACCCT CGCACGACCC GAAGTCGACG TCCTCGACGG ACTCACCACC
GCGATCATCG TCGACCAGCA GCGGCTCGGC GCCGACCCCC GCTCCACCGT CGGCACCGCC
ACCGACGCCA ACGCCATGCT GCGCATCCTG TTCAGCCGGC TGGGACGTCC ACACATCGGC
TCACCCAACG CCTTCTCCTT CAACGTCCCC ACCGTCCGGG CAAGCGGCGC GATCACCACC
GAACGCGGAA CCAGCAAAAC CGAACGAAAG ACCTTCACCC GCACCGGCGG CATGTGCCCC
CGCTGCGAAG GCCGCGGCGC CGTCTCCGGC TTCGACCTCA CCGCCCTCTA CGACGACAGC
AAATCCCTCA ACGAAGGCGC CCTAACCATC CCCGGCTACA GCGTCGACGG CTGGTACGGC
CGCATCTTCG GCGGCTCCGG CTTCCTCGAC CCCGACAAAC CCATCCGCCG ATACACCAGG
ACCGAACTCC ACGACCTGCT CTACAAAGAA CCCACCAAAA TCAAGGTCGA CAACGTCAAC
CTCACCTACG AAGGCCTCAT CCCGAAAATC CAGAAATCGA TCCTGTCCAA AGACCGCGAA
GCGATGCAGC CACACATCCG CGCCTTCGTC GACCGGGCCG TCACCTTCAC CACCTGCCCC
GACTGCGACG GCACCCGGCT CAGCGAAGCC GCCCGCTCCT CCCGCATCGC CGGCACCAAC
ATCGCCGATG CCTGCGCCAT GCAGATCAGC GACCTCGCCC ACTGGGTCCG CGACCTCGAC
GAACCATCCG TCGCACCCCT GCTCACCGCG CTGCACCACA CCCTCGGCTC CTTCGTCGAG
ATCGGCCTGG GCTACCTCTC CCTCAACCGG CCCTCCGGCA CCCTCTCCGG CGGCGAGGCG
CAGCGCGTCA AAATGATCCG CCACCTCGGC TCCTCACTCA CCGACACCAC CTACGTCTTC
GACGAACCCA CCGTCGGCCT GCATCCCCAC GACATCCAGC GCATGAACAA CCTGCTGCTG
CGACTGCGAG ACAAGGGCAA CACAGTGCTC GTCGTCGAAC ACAAGCCGGA AACAATCGCC
ATCGCCGACC ACGTCGTCGA CCTCGGGCCC GGCGCCGGCA CCGCCGGCGG CACCGTCTGC
TACGAAGGCA CCCTCGCCGG GCTACGAACC AGCGGCACTC TCACCGGCCG CCACCTCGAC
GACCGCGCCA CCCTCAAACC GACCGTGCGC ACCCCCACCG GCCAGCTCCC GATCCGCGGC
GCGACCACCC ACAACCTGCA CGACGTCAAC GTCGACATCC CCCTCGGCGT ACTCGTCGTC
GTCACCGGCG TCGCCGGCTC CGGCAAAAGC TCCCTCATCC ACGGATCGAT CCCCGCCGGC
GCGGACGTCG TCTCGATCGA CCAGACCGCC ATCCGCGGCT CACGACGCAG CAACCCCGCC
ACCTACACCG GACTGCTCGA CCCGATCCGC AAGGCATTCG CGAAAGCCAA CGGTGTCAAG
CCCGCACTGT TCAGCGCCAA CTCCGAAGGC GCCTGCCCCG CCTGCAACGG CGCTGGCGTC
ATCTACACCG ACCTGGCGAT GATGGCCGGC GTCGCCAGCA CCTGCGAAGA ATGCGACGGC
AAACGGTTCG AAGCCTCCGT GCTCAACCAC CACCTCGGCG GCCGCGACAT CAGCGAAGTC
CTCGCCATGT CCGTCACCGA CGCCCAGGAG TTCTTCGGCA CCGGCGAGGC ACGCACACCC
GCCGCACACA CCATCCTCAA CCGGCTCGCC GACGTCGGAC TCGGCTACCT CACCATCGGC
CAGCCACTCA CCACCCTCTC CGGCGGCGAA CGGCAACGAC TCAAACTCGC CACCCACATG
GCCGACAGGG GCGCCACCTA CATCCTCGAC GAACCCACCA CCGGCCTGCA CCTCGCCGAC
GTCGAACAAC TCCTCGGCCT ACTCGACCGG CTCGTCGACT CCGGCAAGTC CGTCATCGTC
ATCGAACACC ACCAGGCCGT CATGGCCCAC GCCGACTGGA TCATCGACCT CGGCCCCGGC
GCCGGTCACG ACGGCGGCCG GATCGTCTTC GAAGGCACAC CCGCCGACCT CGTCGCCGCC
CGTTCCACCC TCACCGGCGA ACACCTCGCC GCCTACATCG GCACCTGA
 
Protein sequence
MSNATRTDPQ PPAPHLADSH DLIRVHGARV NNLKDLSVEL PKRRLTVFTG VSGSGKSSLV 
FGTIAAESQR LINETYSAFV QGFMPTLARP EVDVLDGLTT AIIVDQQRLG ADPRSTVGTA
TDANAMLRIL FSRLGRPHIG SPNAFSFNVP TVRASGAITT ERGTSKTERK TFTRTGGMCP
RCEGRGAVSG FDLTALYDDS KSLNEGALTI PGYSVDGWYG RIFGGSGFLD PDKPIRRYTR
TELHDLLYKE PTKIKVDNVN LTYEGLIPKI QKSILSKDRE AMQPHIRAFV DRAVTFTTCP
DCDGTRLSEA ARSSRIAGTN IADACAMQIS DLAHWVRDLD EPSVAPLLTA LHHTLGSFVE
IGLGYLSLNR PSGTLSGGEA QRVKMIRHLG SSLTDTTYVF DEPTVGLHPH DIQRMNNLLL
RLRDKGNTVL VVEHKPETIA IADHVVDLGP GAGTAGGTVC YEGTLAGLRT SGTLTGRHLD
DRATLKPTVR TPTGQLPIRG ATTHNLHDVN VDIPLGVLVV VTGVAGSGKS SLIHGSIPAG
ADVVSIDQTA IRGSRRSNPA TYTGLLDPIR KAFAKANGVK PALFSANSEG ACPACNGAGV
IYTDLAMMAG VASTCEECDG KRFEASVLNH HLGGRDISEV LAMSVTDAQE FFGTGEARTP
AAHTILNRLA DVGLGYLTIG QPLTTLSGGE RQRLKLATHM ADRGATYILD EPTTGLHLAD
VEQLLGLLDR LVDSGKSVIV IEHHQAVMAH ADWIIDLGPG AGHDGGRIVF EGTPADLVAA
RSTLTGEHLA AYIGT