Gene Franean1_6202 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_6202 
Symbol 
ID5675781 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp7534481 
End bp7536919 
Gene Length2439 bp 
Protein Length812 aa 
Translation table11 
GC content75% 
IMG OID641245054 
ProductPII uridylyl-transferase 
Protein accessionYP_001510451 
Protein GI158317943 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG2844] UTP:GlnB (protein PII) uridylyltransferase 
TIGRFAM ID[TIGR00277] uncharacterized domain HDIG
[TIGR01693] [Protein-PII] uridylyltransferase 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0708952 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAACGACA GGCTCCAGGC CCTGCGGGGC CGGGTGGACG CTTTTCCCGG TGTGACCGGG 
CCCGCGTTGC GGGAGGCGCT GGTCGAGCGC GCCGACGCGG CCCTCGGGGA GCTGTTCGGG
GAACCAGGGC CCGGAGTCGC GCTGCTCGCG GTCGGTGGCT ACGGGCGAAG CGAGCCCGCC
TTCGGCAGCG ACCTGGACCT TGTGCTCGTC CACGACGGCC GCCGAGGCGA CATCTCCGCC
GTCGCCGACG CGATCTGGTA CCCGCTGTGG GACGCCGGAG TCGGCCTGGA CCACAGCGTC
CGCACCGTGG ACGAAGCCGT CTCCGTCGCC GACTCCGACC TCAAGGCGGC GCTCGGCCTG
CTCGACGCCC GCGTGATCCG CGGCGACGGC GGCCTCGCGG GCACGCTGCT GGAACGGGTC
CGGACCCGGT GGCGGGAACG CGCACCGCGC CGCCTGCCCG AGCTCGCCGA GGCGGTCGCG
GAGCGCGCGC GCCGGATGGG CGAGCTCGCC TTCCTGCTCG AGCCCGACCT CAAGGAGGCC
CGGGGCGGCA TGCGGGACGT CCACGCGCTG CACGCCCTCG CCGCGGCCTG GGTCGCCGAG
GCGCCGCCGG AGCGCGTCCA GGCCGCCTAC CAGGTGCTGC TCGACGCCCG CGGTGAGGTG
CGCCGGCTCA GTCACGGCCG CGCGCAGGAC CGCCTGCTCC TGCAGGACGG CACGGCGGTG
GCCGCGGCAC TGGGCTACCC CGACTCCGTC GCGCTGTCCC GGGCCATCTC GGACGCCGGC
CGCACGATCT CCTGGACGTG GGACACGACC TGGTACCGGG TGGCCGCCGC GCAACGCTCG
AAGATCCGTT CCCGGCTGCG CCGGCCGGTC CGACGTCCCC TCGACGAGGG CGTCGTCGAA
CAGGACGGGG AGATCCAGCT GGCCCGCGAC GCCGAACCCG CCACCGACCC CGGGCTGGTG
CTCCGGGCGG CCGCGGCCGC GGCACGCGCC GGCGTCCCGT TCGGGCGCTA CGCCATCGAC
CGGCTCGGCC GCGAGGCGCC GGCGATGCCC GAGCCGTGGC CCGAGGCCGC CCGGGACGAC
CTGGTCAGCC TGCTGGCGAC CGGGGACGCC GCGGTGCGGG TGCTGGAGTC CCTCGACCAG
GTCGGGCTGC TCGTGCGGCT GATCCCCGAG TGGGCGGCCG TGCGCAGCAA GCCGCAGCGC
AACCCCTACC ACCGCTACAC CGTCGACCGG CACCTGATCG AGGCCGCCGC CCGCGCCGCC
GCCCACACCA GGGACGTCGA CCGGCCCGAC CTGCTGCTGC TCGGCGCGTT GCTGCACGAC
ATCGGCAAGG GATACCCGGG AGACCACACT GACGCCGGCA TCGTCATCGT CCAGACGTTG
GCGCCCCGGC TGGGCCTGCC GCCCGAGGAC ACCGAGGTGC TGGTCGCGAT GGTCCGCCAT
CACCTGCTGC TGACCGAGGC GGCGACCCGG CGGGACATCG ACGACGTCGC CACCATCGAG
TCGGTGGCGG CCGCAGCCGG CAGCGTGCGC GTACTCACCC TGCTGCACCG GCTTACCGAA
GCCGACGCGA AGGCCACCGG CCCGACCGCG TGGAACGCCT GGAAGGCCCG GCTCGTCACC
GACCTCGTCG ACCGGGCCAC CGCCGTGCTC GACGGCAAGG CACCGCCGTG CCCGCCGGCG
CTCACCGAAC GGCAGACCGC CCTGCTCGCG CTGCCCGACG ACCTCACCGT GCAGGTCAAC
CCGCTCCCCG ACGAGGGCAT GTTCGAGATC GTGGTCGTCA CCGCCGACCG GGTGGGCCTG
CTCGCGACCA CCGCCGGCGT CCTCGCGCTC AACCGGCTGG ACGTCCGCCG CGCCTCCGCG
CGGGGCGCCG GCGGCCGCTC GCTGCTACAG GCCGCCGTCG CGTCCGCGCA CGGGCACCGC
CCGGACCCGA AGCGGCTGCG CGACGACCTG CGCGCGGCCC TGGCGGGCAC CCTCGACGTC
ACCGCCCGCC TGACCGGCCG GGAACAGGAC TACGCCACCA CCCGTCGGTG GAACACGCCG
GGGGCACCTC AGGTGATTTT CGACGACTCC GGCTCGACGA CGGTCATCGA GGTGCGCGCC
CCCGACCGTG CCGGGGTGCT GCACCGGATC ACCGGCGCGC TGGCCGAGGC CGGCCTGGAC
GTCCGCACCG CGATCGTCGC GACCCTCGGC CTGGACGTCG TCGACGCCTT CTACGTCGAG
GACTCGTCGG CCCCGGTCAC AGCTGCCTCC ACGGCCAGCA CGGCCAGTTC TTCGGCCACC
ATCTCCACGT CGGCCACCTC TGCGTCTGTC GGGTCTGTGG CCGCGGCGTC CGCCGCCTCC
GTGACCGGCC TCGCTGCCCC GCGACTTCTG GGTAGCGCGC GGCGGGCCGA GGTGGCGACC
GCCGTCATGG CCGCGCTCGA CATAGGGGAG CCCGGCTAA
 
Protein sequence
MNDRLQALRG RVDAFPGVTG PALREALVER ADAALGELFG EPGPGVALLA VGGYGRSEPA 
FGSDLDLVLV HDGRRGDISA VADAIWYPLW DAGVGLDHSV RTVDEAVSVA DSDLKAALGL
LDARVIRGDG GLAGTLLERV RTRWRERAPR RLPELAEAVA ERARRMGELA FLLEPDLKEA
RGGMRDVHAL HALAAAWVAE APPERVQAAY QVLLDARGEV RRLSHGRAQD RLLLQDGTAV
AAALGYPDSV ALSRAISDAG RTISWTWDTT WYRVAAAQRS KIRSRLRRPV RRPLDEGVVE
QDGEIQLARD AEPATDPGLV LRAAAAAARA GVPFGRYAID RLGREAPAMP EPWPEAARDD
LVSLLATGDA AVRVLESLDQ VGLLVRLIPE WAAVRSKPQR NPYHRYTVDR HLIEAAARAA
AHTRDVDRPD LLLLGALLHD IGKGYPGDHT DAGIVIVQTL APRLGLPPED TEVLVAMVRH
HLLLTEAATR RDIDDVATIE SVAAAAGSVR VLTLLHRLTE ADAKATGPTA WNAWKARLVT
DLVDRATAVL DGKAPPCPPA LTERQTALLA LPDDLTVQVN PLPDEGMFEI VVVTADRVGL
LATTAGVLAL NRLDVRRASA RGAGGRSLLQ AAVASAHGHR PDPKRLRDDL RAALAGTLDV
TARLTGREQD YATTRRWNTP GAPQVIFDDS GSTTVIEVRA PDRAGVLHRI TGALAEAGLD
VRTAIVATLG LDVVDAFYVE DSSAPVTAAS TASTASSSAT ISTSATSASV GSVAAASAAS
VTGLAAPRLL GSARRAEVAT AVMAALDIGE PG