Gene Francci3_0420 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_0420 
Symbol 
ID3903244 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp498762 
End bp501119 
Gene Length2358 bp 
Protein Length785 aa 
Translation table11 
GC content73% 
IMG OID637877751 
ProductPII uridylyl-transferase 
Protein accessionYP_479536 
Protein GI86739136 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG2844] UTP:GlnB (protein PII) uridylyltransferase 
TIGRFAM ID[TIGR00277] uncharacterized domain HDIG
[TIGR01693] [Protein-PII] uridylyltransferase 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.599891 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGGACC GCCTGGAGAT CCTCAAGGGC CGCTCCGAGC ACCTGCCCAC GGCGACCGGC 
CCACGGCTGC GCCGCATCCT CACCGACCTC GCCGACGATG CCCTAGTCGA CCTGTTCGGC
GATCCCGGAC CGGGAATTGC CCTGGTGGCC GTCGGCGGCT ACGGACGCGC GGAACCCGCG
TTCGGCAGCG ATCTCGACCT CGTCCTCGTC CACGACGGCA AACGGAACGA CATCTCCGCC
GTCGCCGACG CGGTCTGGTA TCCGCTGTGG GATTCGGGGG TGGGTCTCGA TCACAGCGTG
CGCACCGTCG ACGAGGCCAT CATGGTGGCC GACGGTAACC TCAAGGCCGC GCTCGGCCTG
CTCGACGCCC GCCACGTCGC GGGTGACGCG ACGCTGGCCA CGCAGCTTGC CCAGCGGGCT
CGCGCCCGCT GGCGGTCCCA TGCTCCGAAA CGACTACCCG AGCTCGCCGA GGCCGTCGCG
GAACGGGCCA CCCGACTCGG TGAGGTCGCG TTCCTGCTCG AACCCGACCT GAAGGAATCC
CGCGGCGGGC TGCGTGACGT CCATGCCCTG CACGCCCTCG CGGCGGCCTG GGTCGCCGAG
TCCCCCCCCG CGCGGGTGAT CGCCGCCGGT CAGGTGCTGC TCGATGCCCG CGGCGAGATC
CGCCGGCTGA GCCGGGGACG AGCCCACGAT CGGCTCCTGC TCCAGGACTC CGAGTCGCTC
GCCACCATGC TCGGCTACGC CGACGCGTTC GCGCTCGCGC ACGCGGTCGC CGACGCCGGA
CGCGCCATCT CCTGGACCTG GGACACCACC TGGTACCGGG TCGCGGCCGG CCTGCGGTCG
GTGGCGCGGT GGCGCGGTGG CGGCCGGCGG CCGGCCCGCC GCCCCCTCGG CGAAGGGGTG
GTCGACCAGG ACGGCGAGGT CCAGCTCGCC CGGGACGTCG ATCCGGCGAA GGACCCGGTC
CTGGTGCTGC GGGCCGCCGC CGCGGCCGCA CGAAGCGGGC TCCCGTTGGG CCGCTACGCC
GTCGACCGGC TGGCCCGGGA GGCCCCACCG ATGCCCGACC CCTGGCCTGA CGCCGCCCGT
GACGCGCTGG TCGCGCTGCT GGCCACCGGC GACGCGGCCG TGGGGGTGCT GGAATCCCTC
GACCAGGTGG GGCTGCTCGT CCGGCTCATC CCGGAGTGGG CGGCGGTGCG CAGCAGACCG
CAACGCAATC CCTACCACCG CTATACCGTC GACCGGCATC TGATGGAGGC CGCGGCGCGG
GCCGCCGCGC ACACCCGCGA GGTGGACCGG CCGGATCTGT TGCTGCTCGG CGCCCTCCTG
CACGATATCG GTAAGGGCTT CCCCGGCGAC CACACCGACG CGGGCATCGT GCAGATCGAA
CGGATCGGCC CGCGGCTGGG CCTGCCGCCC GAGGACGTCG ACGTGCTTGT CGCCATGGTC
CGCCACCACC TGCTTCTCGG CGAGGCCGCG ACCAGGCGGG ACATCGACGA TCCCGGGACA
ATCGCGAGCG TCGCCGCGTC CGCCGGCAGC GTGCGGATCC TCACCCTGCT GCACCGACTG
ACCGAGGCCG ACTCCCGGGC CACCGGTCCG ACCGCGTGGA ACGCCTGGAA GGCGCGGCTG
ATCGGTGACC TCGTCGAGCG GGCCCGAGCC GTCCTCGGCG GCGAGCGGGT CCCGTCTCCC
CCGAGCCTCG ACGATCGGCA GCGTGGCCTG CTCGCGCTGC CCGACGAGCA CGTCGTCCAG
GTCAATCCGC TGCCGGACGA GGGCATGTTC GAGATCGTGG TGGTCAACGC CGACCACGTC
GGGCTGCTGG CGATCATCGC TGGTGTCCTC GCGCTGAACC GATTGGACAT CCGGCGGGCC
TCGGCCCGGA GGAACGGCGG TCGGGCGCTG CTGCAGGCCG CCGTGGCGGC GCCGCACGAA
CGGGTACCGG AACCGGCGCG GTTGCTCGCC GACCTGCGGG CCGCGCTCGC CGGCACCCTC
GACGTCGCCG GTCGGCTCGC CGCCCGGGAA CGCGACTACG CCGGGGCCAG GCGGTGGACC
ACGCCCGGCG CCCCTCAGGT GATCTTCGAT GACGACCTGG GATCGATGAC CGTGATCGAG
GTGCGCGCCC CAGACCAGGC CGGCGTGCTG TACCGGATCG TACGGGCGCT GTCCGGGTTG
CGACTCGATG TCGCCACCGC CATCGTCGCG ACCCTCGGAC TAGACGTGGT GGACGCCTTC
TACGTGCGGG AGGCCGATGG TGGTCCGGTG GCGGACGGCG GACGGCGCCG TGAGATCGCG
GACGCGATCC TCGCGGCCCT GGGACCGATG GAACCCATAG ACCGCGCGGA ATCGGCGGGC
CGGACCACAC AGACGTGA
 
Protein sequence
MTDRLEILKG RSEHLPTATG PRLRRILTDL ADDALVDLFG DPGPGIALVA VGGYGRAEPA 
FGSDLDLVLV HDGKRNDISA VADAVWYPLW DSGVGLDHSV RTVDEAIMVA DGNLKAALGL
LDARHVAGDA TLATQLAQRA RARWRSHAPK RLPELAEAVA ERATRLGEVA FLLEPDLKES
RGGLRDVHAL HALAAAWVAE SPPARVIAAG QVLLDARGEI RRLSRGRAHD RLLLQDSESL
ATMLGYADAF ALAHAVADAG RAISWTWDTT WYRVAAGLRS VARWRGGGRR PARRPLGEGV
VDQDGEVQLA RDVDPAKDPV LVLRAAAAAA RSGLPLGRYA VDRLAREAPP MPDPWPDAAR
DALVALLATG DAAVGVLESL DQVGLLVRLI PEWAAVRSRP QRNPYHRYTV DRHLMEAAAR
AAAHTREVDR PDLLLLGALL HDIGKGFPGD HTDAGIVQIE RIGPRLGLPP EDVDVLVAMV
RHHLLLGEAA TRRDIDDPGT IASVAASAGS VRILTLLHRL TEADSRATGP TAWNAWKARL
IGDLVERARA VLGGERVPSP PSLDDRQRGL LALPDEHVVQ VNPLPDEGMF EIVVVNADHV
GLLAIIAGVL ALNRLDIRRA SARRNGGRAL LQAAVAAPHE RVPEPARLLA DLRAALAGTL
DVAGRLAARE RDYAGARRWT TPGAPQVIFD DDLGSMTVIE VRAPDQAGVL YRIVRALSGL
RLDVATAIVA TLGLDVVDAF YVREADGGPV ADGGRRREIA DAILAALGPM EPIDRAESAG
RTTQT