Gene Franean1_5217 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_5217 
Symbol 
ID5673551 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp6262963 
End bp6264132 
Gene Length1170 bp 
Protein Length389 aa 
Translation table11 
GC content59% 
IMG OID641244071 
Productappr-1-p processing domain-containing protein 
Protein accessionYP_001509481 
Protein GI158316973 
COG category[R] General function prediction only 
COG ID[COG2110] Predicted phosphatase homologous to the C-terminal domain of histone macroH2A1 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0611343 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGCTTT CCGTGAACAC CAAGAAGTGC TTTGTCGTCA TGCCGTTCGG TGAGAAGGCG 
GGGCCAGACG GTTCGTTAAT CGATTTCGAC AACGTCTACC GGGACGTCAT CAGGGAGCCG
GTCGAGTCGC TGGGCTTCAA GGTGGTGAGG GCCGACGAGA TCGAGCGGCC CGGCTCGATC
CACAGCGACA TGTTCCGGCA CATCGCGATG GACGATCTGG CGATCGTGGA CATCACGACG
GGAAACCCCA ACGTCCTCTA CGAGCTTGGC GTTCGACATG CCCTGAGGCC GTCATTGACG
ATCATAATCA AGCGGCGTGG CACAAAAATT CCCTTCAATT TTGCGGGAGA GCGTGTCATC
GACTATCCGA GCGTAAGGGG CAGCTACGCG GACAGTCGAG AGGAGATTCG AAGGTACATT
GAGAACGGGC TGAAGAAAAG TGAGACAGAC AGTCCGATCT TCAATTTTCT GCAGGATGCC
AGGAAGGATT GGAAGCGGGA GCGGATTACT TCGCGAGATG AATACCGCTA TCGGACCGTG
AGCTCGCCGA AGAAGAAGAT CAGTGTGATC ACCGGTGACA TTCGTGACTG GCGTGGTATC
GATGTCTGGG TGAACTCGGA GAACACCAAC ATGCAGATGG CGCGGTTCTT CGACCGTTCG
CTGTCTGCGA TGATCCGATA TGAGGGCGCG GTCAAGGACG CGAGCGATGA AGTTGTCGAG
GACACGATCG CCGGCGAGCT GACCGCGCTC CTCGGAGGTC GGGAGACGGT GACCGCGGGT
GCGGTGTACG TCACCGGCTC GGGTGCTCTC GCCGCAACCC GTGGCGTGAA GAAAATCTTC
CACGCGGCGA GCACCCAGGG CGTTCCGGGG AGCGGATACC AGATGATTCA GAATGTCGAG
AGATGTGTGA CCGCATCGAT GCGGCGTATC GACGAGCAGT TCGCTGACGC AGGACTGAGG
AGCATCGTCT TCCCGATGAT GGGAACAGGG GAGGGTGGCG GCGACGTCTA CGCCACCGCA
CCGCGTCTGA TACAGACGGC CGTTGCCTAT CTGGCCTTCA ACCCAGACAG TGTTGTCGAG
AAGGTCTACT TCTCGGCTTG GAACCGCCGT GACCTCGAGG CCTGCCTGAA CGCCCTGACG
GATGCGGTCG AGGTGGAGCC CATCGGCTGA
 
Protein sequence
MSLSVNTKKC FVVMPFGEKA GPDGSLIDFD NVYRDVIREP VESLGFKVVR ADEIERPGSI 
HSDMFRHIAM DDLAIVDITT GNPNVLYELG VRHALRPSLT IIIKRRGTKI PFNFAGERVI
DYPSVRGSYA DSREEIRRYI ENGLKKSETD SPIFNFLQDA RKDWKRERIT SRDEYRYRTV
SSPKKKISVI TGDIRDWRGI DVWVNSENTN MQMARFFDRS LSAMIRYEGA VKDASDEVVE
DTIAGELTAL LGGRETVTAG AVYVTGSGAL AATRGVKKIF HAASTQGVPG SGYQMIQNVE
RCVTASMRRI DEQFADAGLR SIVFPMMGTG EGGGDVYATA PRLIQTAVAY LAFNPDSVVE
KVYFSAWNRR DLEACLNALT DAVEVEPIG