Gene Francci3_0785 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_0785 
Symbol 
ID3905721 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp912238 
End bp915720 
Gene Length3483 bp 
Protein Length1160 aa 
Translation table11 
GC content66% 
IMG OID637878118 
Productaminoglycoside phosphotransferase 
Protein accessionYP_479898 
Protein GI86739498 
COG category[R] General function prediction only 
COG ID[COG3173] Predicted aminoglycoside phosphotransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGCCTG CGCCACGGTT TCATCGGGTC GCCGCACTGC AGCGGCGTTT CGTGGACCGC 
GAACCGGTGC TGGCGGCTTT CGCCGAAGAG CTGACCCATC TAGGTGACGG TCCACGGGTC
TTCAACGCCA TCGGCGTGGG CGGTATAGGT AAGTCACGAC TACTGCGCGA GTTGAAGGAC
CGGGCGGCAA CCCGGTATCG AACGGCATCA CTCGATCTGC AGGTGCACTC GTTGCGCCAG
CAGGAAGACG CGCTGGCGGT GCTACGCAAC GAACTGGGGT CGCAAGGGGT ACACTTCGAT
CGCTTCGACA TCGCCTACGC AGTGCTGTGG CAGCGGCTAC ACCCACATCT GCGGTTGAAC
CGTTCGGAAC TGGCATTCGT CGACGAGAGT TCCATTCTCG CGGATATCGC CGACACGATG
ACCGGAATTC CGATTTTCGG TACGGCCCGC GGCCTTATCA AGCTGTTGGA GCGTGGCGGC
TGCGACATAC GCCGCCGCCT GCGGGTGCGG CGCGATCCCA CCCTGGCGAC ACTCGACGAC
CTGCCGAACA GTGAGCTCGC CGACGCCGTC ACGTTTCTGT TCGCTGAAGA CCTTCGGGCT
GCCAGCCAGG ACAGACAGTA TGTGATCATC CTTGACAGCC ACGAGGCACT GGTGCCGAGC
CCGGTACGCA CCGGACGGGC GCAACTCGCG GATGCCTGGC TACGTGATCT CGTGGCACAG
CTCGACCGCG GCCTGGTGGT CATCGCCAGC CGGGAACCGT TGCGGTGGGA ACTGACGGAT
CCCGACTGGG ACGGCCTCAT CGAGATATGC GACATCGGCG GCCTGCCCAT GGAAGCCCGG
CTGGAACTCC TCGACACCGG TGGGATCACC GATCCGTTCC AGCGCCAGAC AATTGCGGAC
GCCAGCGCAG GTCTCCCATT CTATCTGAAC CTGGCGGTGG ACACGCACCT GCAGACCGGT
GGGCGGGTCA GCGGTGCCCT CGTGTCCCAG CAGGAGATCT TGGCGCGCTT TCTTCAGCAT
GTCGCCCCGG AAGAGATTCG ATCACTGGAG GTCCTGAGCC CGGCGCGAAT CTTCGACTAT
GATATTTTTC GGATGCTGAC GGCCACGTTC CATCTCCCGG GACATCGCAT CGCCTGGGAA
TCCCTCACCG CATATTCATT CGTATACCCG GCGAACACCG CGCTGCGGTT TCATCAGCTC
ATGGCCGCAG CCCTGCGGGA ACGGCTTTCT CCGGGCACTA CCACCGACAT CCACGCCCTG
CTACAGGGAC TGTGGGAGAA CCGGGCCGAC CAGGCGACCG GCCAGGACAG CGGCGCTCAT
GCCGCCCGCG CCCTGCGGGA GGCCGCGTAT CACGCCCTGC GGGGTCGGCA GATCAGCGGC
GAGAGCATCC TCGGCTATGC CGACCACGCG GTTCGCCGTG GGGGACACAG CGCAGCCCAA
GGCATCACCG AGGACCTGCG CGCCCGGCTC AACGACCAAC CCGACCACGA CAACCTGCCC
GAAGCGCTGC GGTGCCTGCA GGCTGAGGCG GCGATTCGCC TCGGCGACGC GGCCACGGTC
ACGGCGCTGG TTCCCGAACC GGTCACCGCT CTGGCCATCG ACACAATGGT CGGCGCGCGG
CTCGCCGTGG CCGCCGGACA TGGTCGACGC ATCGCCGGTG AGACCCGGGC CGCGCTCGAC
GCCTACACCC AGGTCTGGGA CCACGCCACG GGGGCATCCC GGCTGACCGC GGGCCTGTGG
GCCGCCGACC TTCACATGGC CCAGGGCCGG TTCCGAGACG CGGAAGCACT CGCCACCGAC
CTGAGTGCGT CCGCTCCGGG GCAGGACACC GAGTTTCACG GAGACGTGGC CCGGATGCGT
CACCTCGCCC ACCGGTTCGC GTTCGAGTTC GACGCCGCCG CCCGCTATCT CGACGAGGCC
GCCGCCCACT ACGCCGCAGC CGATAGCGTT CTCGGGTTGG CCAACATCCA GACCAACCGG
GCCGAACTCC TCGCCTATAC CCACCCTGCC GAGGCCATCG TCGAGGCAGG CCAGGCGATC
GAGATCCAAC GCGAGATCGG CGCCCATCAC GAGCTCGGCA AGGCGTATAC CGCTCTCGCC
ATCGGGCAGC TCCGATGCGG TCAGCTCGAC GAGGCCGAGA CCTCCCTACG GTCCGCGTGC
GCCGCCCTCG ATGGGGTCGG GTATCGCTCC GGTCGAGCCC GCGCCGAGTT CTACCGCGCC
CTGGTCCATG CTCGCCGGGG TCAGCTCGAC GAAGCGGTTT CCTCGCTCCA GTGGAGCGTC
CGCGAGCTGG AAGCAGCCGA CGTCTACCCG ACAATCATCC TCTGCTCGGC GCACGCACTG
TCCATCCTCG GCATCGATGA CGGGCAGGTG ACCGCCGCCG CGAGCCGGGC ACGGGCAGGA
ATCCAGCCGT TCGGCACGCT TACCGATCTT GACGCCCGCG TCGAGGACTT CGTCGCCGAC
CTGCTCAACG GGCACACGTG GCGGCCCGAC GAACTGTACC GGCAGGCGGT CGGGCGCCCC
GATGCCGCGT CGGGTTTCTA CAATCACAAC ATCCGGATGG ACACGCCGGC CGGAGACGTG
ATTGTTCGTA TCCCGATTCC GGGCAGCGAC ATCATGGACC TGATGATCTG GCCGGAAGGG
GACGTGCTGC GCGCCATCCG CGGGACCGTG ACCCACGCGC CCCGTCTGCT CTACGCCCGC
ATGCAGCCCC GCTATCAGAT CGTGGAATTC CTTCCCGGTC AGCTCCTCGA CAACACCGCA
CCACGCGGCA CCCGTGTCCC CGACCACGTC ATCGGCGACG TCGTGGAACT CTTCGGGCAG
CTCGGCCTCG TCCCGCGCGA ACGCCTGCCG CAATCGCCGC CAGGCTGGCC GGCGGACGGG
CGGACCGCCG ACTTCGCCCG CCGCCTGTCC GCTGTCACCG CAGGCGTGCA TTCCCGGTTC
CTTCCCGACT TCGGCGATCT GTACGCCGAG TTCGGCATCC CCATCGACGC GCTCACCGAG
ATCAACAGTC GATGGACGAC GCTGCACCCG CGGCCTTTCC GCCTTCTCCA TACCGACATC
CACCGCAAGA ACATCATCAT CTCGGATGGC CAAGCCTACT TCCTCGACTG GGAACTCGCC
CTGTGGGGCG ATCCGGTCTA TGATCTTGCG GTCCACCTGC ACAAGATGAG CTACCAGCCC
GACGAGTCAG CCGCACTCGT CCATGGGTGG ACCTCCGTTG TCAACGGTCC GGCCACCGAG
GGATGGCAGT CAGACCTCGA CACCTACCTC TCCCACGAAC GGGTGAAGTC CGCCATCGTC
GACACCGTCC GCTACACAAA AATCATCACC GAAGGAAACC TCAGCCGCGA TGACACACAT
GCATTCATCG ACAAGCTCGT CAGGAAACTC GCCGCCGCGC ACGCCGTACT CAGCAACCGA
GTTACCATCG ACCACGAGGC CGCCGCAGCC CTCATACAGC AGTGGGCCGG CAGACGCCGA
TAG
 
Protein sequence
MAPAPRFHRV AALQRRFVDR EPVLAAFAEE LTHLGDGPRV FNAIGVGGIG KSRLLRELKD 
RAATRYRTAS LDLQVHSLRQ QEDALAVLRN ELGSQGVHFD RFDIAYAVLW QRLHPHLRLN
RSELAFVDES SILADIADTM TGIPIFGTAR GLIKLLERGG CDIRRRLRVR RDPTLATLDD
LPNSELADAV TFLFAEDLRA ASQDRQYVII LDSHEALVPS PVRTGRAQLA DAWLRDLVAQ
LDRGLVVIAS REPLRWELTD PDWDGLIEIC DIGGLPMEAR LELLDTGGIT DPFQRQTIAD
ASAGLPFYLN LAVDTHLQTG GRVSGALVSQ QEILARFLQH VAPEEIRSLE VLSPARIFDY
DIFRMLTATF HLPGHRIAWE SLTAYSFVYP ANTALRFHQL MAAALRERLS PGTTTDIHAL
LQGLWENRAD QATGQDSGAH AARALREAAY HALRGRQISG ESILGYADHA VRRGGHSAAQ
GITEDLRARL NDQPDHDNLP EALRCLQAEA AIRLGDAATV TALVPEPVTA LAIDTMVGAR
LAVAAGHGRR IAGETRAALD AYTQVWDHAT GASRLTAGLW AADLHMAQGR FRDAEALATD
LSASAPGQDT EFHGDVARMR HLAHRFAFEF DAAARYLDEA AAHYAAADSV LGLANIQTNR
AELLAYTHPA EAIVEAGQAI EIQREIGAHH ELGKAYTALA IGQLRCGQLD EAETSLRSAC
AALDGVGYRS GRARAEFYRA LVHARRGQLD EAVSSLQWSV RELEAADVYP TIILCSAHAL
SILGIDDGQV TAAASRARAG IQPFGTLTDL DARVEDFVAD LLNGHTWRPD ELYRQAVGRP
DAASGFYNHN IRMDTPAGDV IVRIPIPGSD IMDLMIWPEG DVLRAIRGTV THAPRLLYAR
MQPRYQIVEF LPGQLLDNTA PRGTRVPDHV IGDVVELFGQ LGLVPRERLP QSPPGWPADG
RTADFARRLS AVTAGVHSRF LPDFGDLYAE FGIPIDALTE INSRWTTLHP RPFRLLHTDI
HRKNIIISDG QAYFLDWELA LWGDPVYDLA VHLHKMSYQP DESAALVHGW TSVVNGPATE
GWQSDLDTYL SHERVKSAIV DTVRYTKIIT EGNLSRDDTH AFIDKLVRKL AAAHAVLSNR
VTIDHEAAAA LIQQWAGRRR