Gene Francci3_3523 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_3523 
Symbol 
ID3904462 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp4207341 
End bp4209281 
Gene Length1941 bp 
Protein Length646 aa 
Translation table11 
GC content71% 
IMG OID637880845 
Productmetal dependent phosphohydrolase 
Protein accessionYP_482605 
Protein GI86742205 
COG category[R] General function prediction only 
COG ID[COG1418] Predicted HD superfamily hydrolase 
TIGRFAM ID[TIGR00277] uncharacterized domain HDIG
[TIGR03319] conserved hypothetical protein YmdA/YtgF 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.555023 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.082287 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAGGGCG TCCTCGTCGT CCTGTTGTCA CTCGTACTTG TTGTCCTGAG CGTTCTGATC 
CTCGCCGTGG CGCGGTTGGT CCGGGCGACC CGGGTCGACA AGGTACCCGA CCCCGCTCCG
GTGGTGCCCC GAACCCCGGC CGCCCGGGGC GTCGGGGACG TGACGGGTCC GGCGGACTTT
GACGAGGAGC CGACGGTCCG GGTCCTGCCC GCCCCTTGGG AGGGTTCCGG TGCTCCGGCC
ACGACGGACG CCGACTCACC CGCCGATCGG GACGGCACCG CCGCGCGGAC CGGGGATTCG
GCGGTGACCG CCGGTCGCTC CGACGGCGGC CTCCGCGCGG CTCATGGTGG TAGCGCGGAG
GAGGCGGCGC AGATTGTGGC CCGAGCCGAA CGGGAGGCGG CGGAACGGTT GGCGCGGGCT
GAGCGGGATG CCGCCGAGAT CCGGCGGCGC GGCGAGGAGG ATGTCGCCCT ACTGCGCGAG
CGGATGCTCG CCGAGGCGGC GGTCGAAACC TCGCGAGTCC AGGCCGCGGC GAGAGAGTCC
GTCCGCGCCG AGCAGGAGGC CGCCCGGACC GAGATCGCCG CGACTCGGGC GGCGTTCGAC
GGTGAGCAGC AGGCCTGGCG GACGGAACTG CAGAGCCGGG AGGTTGCGAT AGCCGCCCGG
GAACAGCGCG TCGAGGACCG GATGGCCAGC CTCGACGATC ATGGTCGCCG GCTGGCGGAC
CGCGACCGCG ACCTGCTCGA CCGGGAGAAC GACCTGACCC GTCGGACGGC CGAGGTGGCC
GACCTCGAAC GTGCCCGTCA TGCCGCGCTG GAGCAGGTGG CCGGGCTCAC CGCCGGGCAG
GCCAGGGGAG AGCTGATCGC CGTCATCGAG CAGGAGGCCC GGCGGGAGGC GGCGCTGACG
GTCCGCGAGA TCGAGGCCCG GGCCGAGGAG GAGGGTGAGG AACGCGCCCG CAGGATCGTG
ACCACCGCCA TCCAACGGGT CGCGTCCGAC CAGACCACCG AGTCCGTCGT GACGGTCCTG
CATCTTCCCG GCGATGAGAT GAAGGGCAGG ATCATCGGGC GGGAGGGGCG CAACATTCGG
GCTTTCGAGT CCGTCACCGG GGTCAACGTG CTCATCGACG ACACGCCCGA GGCGGTGCTG
CTGAGCTGCT TCGATCCGGT GCGTCGCGAG GTCGGTCGCA TCACGCTGGC GGCTCTGGTG
TCCGACGGCC GGATCCATCC GCACCGCATC GAGGAGGAGT ACGCCCGCGC CCAGCTCGAG
GTGGCCGAGC GGTGCGTGCG GGCGGGCGAG GACGCCCTGC TTGAGACCGG CATCTCCGAG
ATGCACCCCG AGCTGGTTAA CCTGCTGGGC CAGTTGCGTT ACCGAACCAG CTACGGCCAG
AACGTGCTCG CGCACCTGAT CGAAAGCGCC CACCTCGCCG GAATCATGGC CGCCGAGCTG
CGCATGCCGC TTCCACTCGC GAAACGAGCG GCTCTGCTGC ACGACCTCGG CAAGGCGCTC
ACCCACGAGA TCGAGGGCTC TCACGCGTTG ATCGGGGCGG ATGTGGCCCG TCGCTACGGT
GAGGACGAGC AGGTCGTGCA CGCGATCGAG GCCCATCACA ACGAGGTCGC ACCCCGCTCG
ATCTGCGCGG TGCTGACCCA GGCCGCCGAC CAGATCTCCG GTGGCCGGCC TGGCGCCCGC
CGCGACAGCC TGGAGTCGTA TGTGAAACGG CTCGAGCGCA TCGAGCAGAT CGCCGGTGAC
CGTCCGGGTG TCGACAAGGT GTTCGCCATG CAGGCCGGCC GGGAGGTGCG TGTCATGGTC
GTGCCCGAGG AGATCGACGA TCTCGCCGCC CATCTGCTCG CCCGGGACGT CGCCAGGCAG
ATCGAGGAGG AGCTCACCTA TCCGGGTCAG ATCCGGGTGA CCGTCGTGCG CGAGACCCGT
GCGGTGGGCA CCGCCCGCTG A
 
Protein sequence
MEGVLVVLLS LVLVVLSVLI LAVARLVRAT RVDKVPDPAP VVPRTPAARG VGDVTGPADF 
DEEPTVRVLP APWEGSGAPA TTDADSPADR DGTAARTGDS AVTAGRSDGG LRAAHGGSAE
EAAQIVARAE REAAERLARA ERDAAEIRRR GEEDVALLRE RMLAEAAVET SRVQAAARES
VRAEQEAART EIAATRAAFD GEQQAWRTEL QSREVAIAAR EQRVEDRMAS LDDHGRRLAD
RDRDLLDREN DLTRRTAEVA DLERARHAAL EQVAGLTAGQ ARGELIAVIE QEARREAALT
VREIEARAEE EGEERARRIV TTAIQRVASD QTTESVVTVL HLPGDEMKGR IIGREGRNIR
AFESVTGVNV LIDDTPEAVL LSCFDPVRRE VGRITLAALV SDGRIHPHRI EEEYARAQLE
VAERCVRAGE DALLETGISE MHPELVNLLG QLRYRTSYGQ NVLAHLIESA HLAGIMAAEL
RMPLPLAKRA ALLHDLGKAL THEIEGSHAL IGADVARRYG EDEQVVHAIE AHHNEVAPRS
ICAVLTQAAD QISGGRPGAR RDSLESYVKR LERIEQIAGD RPGVDKVFAM QAGREVRVMV
VPEEIDDLAA HLLARDVARQ IEEELTYPGQ IRVTVVRETR AVGTAR