Gene Franean1_6055 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_6055 
Symbol 
ID5674376 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp7367721 
End bp7368614 
Gene Length894 bp 
Protein Length297 aa 
Translation table11 
GC content72% 
IMG OID641244903 
Producturacil-DNA glycosylase superfamily protein 
Protein accessionYP_001510305 
Protein GI158317797 
COG category[L] Replication, recombination and repair 
COG ID[COG3663] G:T/U mismatch-specific DNA glycosylase 
TIGRFAM ID[TIGR00584] mismatch-specific thymine-DNA glycosylate (mug) 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.245344 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCAGCA CCAGACCGCC CAGCCAGGGC ACCGCGCCGG ACCAGCCGTC CCCGGCCGAC 
CAGACACCAG AAGACACGCT GCCGCAGACA CTCACCCTGT CGACGGGCCC CACGCTGCCA
TCAGAAGCTG CGACGCCGGG AATGGAAGCC GCGCCGGCGC AAGAAGCCGC CCCCGCGCAG
GAAGCCACGG ACAGGCCCGT AGCCGCCCCG GCACCGGAGG CCGCCGACAC GCCTGTAGCC
GGGCCGGCGG CGCTGAGGTC ACCGAAGCCG CCGCGGCGGC CCCGACCGGA CCGCGCGGAG
CTTCTCGCGG CCTACGGGAA GACCGTTCCG GACCTCGTCG GTCCGGAGAC CCGGGTGCTG
CTGTGCGGTA TCAATCCGTC CCTGGAGTCC GGCGCCACCG GATTCCATTT CGGGACGCCC
AGCAATCGGC TCTGGCCGGT CCTGCACTTC GCCGGGTTCA CCGGGCGCCG GCTGCATCCG
TCCGAGACCG AGCACCTACG CGCCCGGGGC ATCGGAATCA CGAATCTGGT GCACCGCTCG
ACCGCTCGCG CCGATGAGAT CGCTGACGAC GAAATCAGGG CCGGTGTGCC GGTACTCATC
GAGCTTGTCG AACGGATCCG CCCGGAATGG GTCGCCTTTC TCGGGCTCGC CGCGTACCGC
ATCGGCTTCG GGCGGCGGAC GGCGAAGGTC GGTCGACAGC CGGAGCGCAT CGGTCCCGCC
GGGGTGTGGC TGCTACCGAA CCCCAGTGGG CTGAACGCGC ACTACCAGCT ACCCGACCTT
GTCCGGGTCT ACGGCGAACT GCGCGAGGCC GCCTTCGGGC CTGTCACGGC CACCACGCCG
GCAGCGGGCC CGACCGCGGG TCCCGGACTC AGGTCCGGGC ACGGCGGCGG CTGA
 
Protein sequence
MASTRPPSQG TAPDQPSPAD QTPEDTLPQT LTLSTGPTLP SEAATPGMEA APAQEAAPAQ 
EATDRPVAAP APEAADTPVA GPAALRSPKP PRRPRPDRAE LLAAYGKTVP DLVGPETRVL
LCGINPSLES GATGFHFGTP SNRLWPVLHF AGFTGRRLHP SETEHLRARG IGITNLVHRS
TARADEIADD EIRAGVPVLI ELVERIRPEW VAFLGLAAYR IGFGRRTAKV GRQPERIGPA
GVWLLPNPSG LNAHYQLPDL VRVYGELREA AFGPVTATTP AAGPTAGPGL RSGHGGG