Gene Francci3_3930 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_3930 
Symbol 
ID3906889 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp4704013 
End bp4704867 
Gene Length855 bp 
Protein Length284 aa 
Translation table11 
GC content67% 
IMG OID637881257 
Productnucleotidyl transferase 
Protein accessionYP_483009 
Protein GI86742609 
COG category[J] Translation, ribosomal structure and biogenesis
[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1208] Nucleoside-diphosphate-sugar pyrophosphorylase involved in lipopolysaccharide biosynthesis/translation initiation factor 2B, gamma/epsilon subunits (eIF-2Bgamma/eIF-2Bepsilon) 
TIGRFAM ID[TIGR02623] glucose-1-phosphate cytidylyltransferase 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.24112 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCACGG GAGACGCAAC AGCCTTGAGC ATCGAGATCG CCCAGACCAG CGAGATTCCC 
GACGTGGCGG ACATCCCCGT CGTGATCCTG TGCGGCGGGA TGGGAACCCG GCTGCGGGAA
GCCAGCGAGA AACTACCCAA GCCGCTGGTG GACATCGGCG GCAAGCCGGT GCTGTGGCAC
ATCATGAAGA CCTACGAGCA CTATGGCTTC CGTAAGTTCG TGCTCTGCCT CGGCTACAAG
AGCGATCTGA TCAAGAACTA CTTCCTCGCC TACCGTGCGC AGGTCGCCGA CTTCACCCTC
ACGCTCTCCG ACGACCACAC CCCCCAGTTC CACAACACCG TGGGCGACGA GGCGTGGGAG
GTGACCTTCG CCGAGACGGG CCTACTCACC GGAACCGGAG CCCGGCTGCG CCGGGTCGCC
CAGTACCTGA CCGGCCCGCG GTTCATGCTG ACCTACGGCG ACGGCGTGGG TGCCGTCGAT
GTCGGCGCGG TGCTCGCCGA CCACCTGGCG TCGGGGCGGA TCGGGACGGT CACCGGCGTC
CGGCCGTCGA GTCGCTACGG CGAGCTGACC ACGGACGGCA ACGCCGTCAC CCTCTTCGCC
GAGAAGCCGC CGCAGACCGG CTGGGTGAGC GGGGGATACT TCGTCTTCGA GCGCGAGTTC
ATCGACAAGT ACCTCGACGA CGACCCGGCG CTGCTGCTGG AGCGTCACCC GCTGCAGCAG
CTGGCCCGGG ACAGCGAGCT GACCCTGCAC ACTCACGACG GGTTCTGGAT GGGTATGGAC
ACGTTCCGCG ACTGGACCGA GCTGAACCAG CTCTGGGATT CCGGTGCCGC GCCCTGGCGT
GTCTGGGCCG GCTGA
 
Protein sequence
MSTGDATALS IEIAQTSEIP DVADIPVVIL CGGMGTRLRE ASEKLPKPLV DIGGKPVLWH 
IMKTYEHYGF RKFVLCLGYK SDLIKNYFLA YRAQVADFTL TLSDDHTPQF HNTVGDEAWE
VTFAETGLLT GTGARLRRVA QYLTGPRFML TYGDGVGAVD VGAVLADHLA SGRIGTVTGV
RPSSRYGELT TDGNAVTLFA EKPPQTGWVS GGYFVFEREF IDKYLDDDPA LLLERHPLQQ
LARDSELTLH THDGFWMGMD TFRDWTELNQ LWDSGAAPWR VWAG