Gene Francci3_4370 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_4370 
Symbol 
ID3907344 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp5219476 
End bp5220786 
Gene Length1311 bp 
Protein Length436 aa 
Translation table11 
GC content74% 
IMG OID637881701 
Productphosphoribosylamine--glycine ligase 
Protein accessionYP_483445 
Protein GI86743045 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0151] Phosphoribosylamine-glycine ligase 
TIGRFAM ID[TIGR00877] phosphoribosylamine--glycine ligase 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.766514 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGATTC TTGTTGTCGG CTCCGGCGGC CGTGAACACG CGCTGTGCCG GGCGCTCGCC 
CGCGATCCCC GGGTCGGCTC GCTGGTCTGC GCGCCCGGAA ACGCCGGAAC CGCCGAGCTC
GCCGAGCCCC GCCCGCTCGA CGTCGCTGAC CCGGACGCCG TCGCCGACCT TGCTGAGGCG
GTCGGCGCGG ACCTCACCGT CATCGGTCCG GAGATGCCGT TGGTAACCGG GGCCGCCGAC
GAGATACGCG CCCGCGGTCT CGCGGTCTTC GGCCCGAGCG CGGCCGCGGC CCGGCTGGAG
GGCAGCAAGG CGTTCGCCAA GGAGGTCATG CGCGCCGCCG GGGTGCCCAC GGCGGCCTCT
CGCGACCACA CCGAGATCGA GCCCGCCCTG GCCGACCTGG ACGTCTTCGG GCCACCGTAC
GTGGTGAAGT ACGACGGGCT CGCCGCCGGC AAGGGCGTCA CGGTGACCGA GGATCGCGAC
GCGGCCGTCG CCGCGGTCCG CGCGAGCCTG CGCGGACCGG ACGACCGGGT TGTCCTCGAG
GAGTACCTCG ACGGGCCCGA GGTCTCCCTG TTCGCCGTGG TGACCGAGTC CGGCGCGGTC
CTCCCGATGC TGCCGGCGCA GGATCACAAA CGGGTCGGCG ACGGCGACAC CGGCCCGAAC
ACCGGGGGGA TGGGCGCGTA CACCCCGCTG CCCTGGGCCT CGCATGGCCT GCCGGCGAAG
ATCGTCACCA CGGTGATCCG GCCGACGGTG GCCGAGATGG CCCGGCGCGG CACCCCGTTC
ACCGGTCTGC TCTACGCCGG GCTCGCGCTG ACCACCCGCG GGCCGCGGGT GGTGGAGTTC
AACGTGCGCT TCGGTGACCC CGAGGTGCAG GCCATCCTCG CCCTGCTCAC GACGCCGTTG
ACGGACGTGC TCTCCGGCCG GCGGGCACCG GTGTGGCGCT CCGGCGCGGC GATCAGCGTC
GTCGTCGCCG CGCACGGCTA TCCGGCGGCC CCCAGGCTCG GCGACCCGAT CCGCGGGCTC
GCCGCCGCCG GCGCGCTGGC CGGCGTCGAC ATCCTGCACG CCGGTACCCG CAGGGAGCCG
GATGGTCGGG TCGTCTCGGC CGGCGGCCGT GTGCTGTCCG TCACGGCCAT CGGCTCCAAC
CTGGAGTCCG CGCGAGGATC GGCCTACGAG GCCGTCAGCC GCATCTCGCT GCCCGGCGCC
CATTACCGCA CCGACATCGG AGATCCGTCC CGGATGCGCC ACGCGGCCGA CGTCCGTCGA
ACGGACGTCG CGTCCGGCGA GGATCCCGCA GGACATAGAA AGAAGGAGTA G
 
Protein sequence
MKILVVGSGG REHALCRALA RDPRVGSLVC APGNAGTAEL AEPRPLDVAD PDAVADLAEA 
VGADLTVIGP EMPLVTGAAD EIRARGLAVF GPSAAAARLE GSKAFAKEVM RAAGVPTAAS
RDHTEIEPAL ADLDVFGPPY VVKYDGLAAG KGVTVTEDRD AAVAAVRASL RGPDDRVVLE
EYLDGPEVSL FAVVTESGAV LPMLPAQDHK RVGDGDTGPN TGGMGAYTPL PWASHGLPAK
IVTTVIRPTV AEMARRGTPF TGLLYAGLAL TTRGPRVVEF NVRFGDPEVQ AILALLTTPL
TDVLSGRRAP VWRSGAAISV VVAAHGYPAA PRLGDPIRGL AAAGALAGVD ILHAGTRREP
DGRVVSAGGR VLSVTAIGSN LESARGSAYE AVSRISLPGA HYRTDIGDPS RMRHAADVRR
TDVASGEDPA GHRKKE