Gene Francci3_4531 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_4531 
Symbol 
ID3907508 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp5408668 
End bp5410176 
Gene Length1509 bp 
Protein Length502 aa 
Translation table11 
GC content71% 
IMG OID637881864 
Productmetal dependent phosphohydrolase 
Protein accessionYP_483606 
Protein GI86743206 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0617] tRNA nucleotidyltransferase/poly(A) polymerase 
TIGRFAM ID[TIGR00277] uncharacterized domain HDIG
[TIGR02692] tRNA adenylyltransferase 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGATCAGTC TGGACAATCC GCGAGACTCG GCCGAACGGG TGGTGAGTGA GCTGTTGAGG 
GTGCCGCCGG CCGCCGATGA GCTTGGCCGC GTCTTCACCG CGAACGGCTA CCTCCTTCAC
CTCGTCGGTG GTTCGGTACG GGACGCCCTG CTCGGCAGGC CGGCATCCGC TGTGTCCGGG
GCCGTACCGG CCGATCTCGA CTTTGCCACG GACGCCCGCC CCGAGCGGGT TCTGGAGATC
ACCCGAGGAT GGGCGGAGGC GACCTGGGAG GCCGGAATCG CCTTCGGCAC GGTCGGGCTC
GCTCGGCACG GTGTCCGGTT CGAGATCACT ACCTACCGTA GCGAGGCGTA CGACCGGAAG
TCCCGCAATC CGGCCGTGAC CTACGGCGAC AGCCTGGAGG CGGACCTGTC CCGGCGGGAC
TTCACCGTGA ATGCGATGGC GGTGTCCGTG CCCGGCCATG ACTTCGTCGA CCTGTTCGGC
GGGATGGCAG ATCTCGCCCG CGGCGTGCTG CGTACCCCCG CAAGCCCCGA GGCGTCGTTC
GACGACGACC CGCTGCGCAT CCTGCGGGCC GCCAGGTTCG AGGCGGCCCT GGGCCTCACC
CCGGTACCCG AGCTGGTCGC GGCGATGCGC TCGCGGGCCG ATCGGCTCGC GGTCGTCTCC
CCCGAGCGGA TCCGCGACGA ACTGCGCAAG CTCATGTCGG CTCCCGACCC GGTCGCCGGT
CTGGAGCGGT TGGTCGAGGT CGGGATCGCC GACATCGTGC TGCCCGAGGT GTCCGCCATG
CGGATGGAGA TCGACGAGCA CCACCAGCAC AAGGACGTCT ACGCCCACAC CATGACGGTG
CTGCGTCAGG CGATGACCCT GGAGGAGCCG GGGGAGCCGG ACGAGGTGCT GCGCTGGGCC
GCGCTGCTGC ATGACATCGG CAAGCCGCGG ACCCGCCGGC ACATGACGGG TGGGCGGGTG
TCCTTCCACC ACCATGAGGT CGTCGGCCGG GACATGGCCC GCCGCCGGCT CGCGGCGCTG
CACTTCCCCA AGGACGTGAC TGACGCGGTC TGCCGGCTTG TCTACCTGCA CCTGCGCTTC
CACGGATACG GGGCGGGCGA GTGGACCGAC GCCGCCGTAC GCCGCTACGT GCACGACGCG
GGTCCTCAGC TCTCCCGGCT GCACAAGCTG GTCCGCTCGG ACTGTACGAC CCGCAACCGG
CGCAAGGCCG CGGCGTTGTC CCGGACGTAC GACTCGCTCG AGGAGCGGAT CGCCGAGCTC
GCCGCGCGCG AGGAGATCGC CGCGATCCGG CCGGAGCTTT CCGGGGACGA CATCATGGTG
CTGCTGGGGC TGCCACCGTC CCGGTTGGTG GGCCAGGCCC GGTCGCACAT GCTGGAGTTC
CGCTTCGAGC ACGGGATCGT CGGCCGGGAG GCCGCCGAGG CCGAACTGTT CCGGTGGGCC
CGGGAGCACG AGGTGCCCGT CCCCGGTGAT GCGCCCGTCC CCGGTGAGCT CACGCCTCGG
CGGGGGTGA
 
Protein sequence
MISLDNPRDS AERVVSELLR VPPAADELGR VFTANGYLLH LVGGSVRDAL LGRPASAVSG 
AVPADLDFAT DARPERVLEI TRGWAEATWE AGIAFGTVGL ARHGVRFEIT TYRSEAYDRK
SRNPAVTYGD SLEADLSRRD FTVNAMAVSV PGHDFVDLFG GMADLARGVL RTPASPEASF
DDDPLRILRA ARFEAALGLT PVPELVAAMR SRADRLAVVS PERIRDELRK LMSAPDPVAG
LERLVEVGIA DIVLPEVSAM RMEIDEHHQH KDVYAHTMTV LRQAMTLEEP GEPDEVLRWA
ALLHDIGKPR TRRHMTGGRV SFHHHEVVGR DMARRRLAAL HFPKDVTDAV CRLVYLHLRF
HGYGAGEWTD AAVRRYVHDA GPQLSRLHKL VRSDCTTRNR RKAAALSRTY DSLEERIAEL
AAREEIAAIR PELSGDDIMV LLGLPPSRLV GQARSHMLEF RFEHGIVGRE AAEAELFRWA
REHEVPVPGD APVPGELTPR RG