Gene Franean1_7234 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_7234 
Symbol 
ID5675535 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp8832521 
End bp8833903 
Gene Length1383 bp 
Protein Length460 aa 
Translation table11 
GC content77% 
IMG OID641246071 
Producthypothetical protein 
Protein accessionYP_001511459 
Protein GI158318951 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.273739 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.522383 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACCCGC GGTCAGAGGC TTCCTGCTGT GCTGTCGCGA TGACGCAGGC GGTGCAGTGG 
GATCAGGGCG TCTCGGACGC GTGGTCACGG ATCGCGCAGA TCGGTCCCAA GATCCTGGTC
TTCGCGCTCG TCCTCACCGT CGGAGCCGTG GTCGTCCGCG GCGCGCTCCA CGCCGCGGAC
CGGGTGCTGG AGCAGGCCGG CCTCGACGGC GCGCTCGACC GGGCCGGCGC GTCGCGGCTG
CTGCGCTGCC AGACCGGGGT GATCAGGGGA TGGCTGCTGC GGGTGGTGGC CGTGATCTGC
CTGTTGGCCA TCCTGCGTGC GGCGCTCGGG GTGTTCGGGC CGAGTCCCGC CGACCGGATG
GCCGGCACAG CGCTGGCGCT GCTCGCCCGC GCGCTGCTCG CGGCCGTCAT CGCGCTGCTC
GGGCTGGCGC TGGCCGCCTG GGCGCGGCGC CTGGTCACCG AGTCGTTCGC CGGGCTGCGG
CACGGCAACG CGCTGAGCCG GGCCGTTGCC GGGTTCGCCG TCCTCGCCTT CGGCAAGGCG
GCGCTGGACG AGCTCGGCAT CGGGACGTCG GTCACCACAC CACTGCTCTA CGCAGTGCTC
GCGGCCTGCA CCGGAGTGGT GGTGGTCGGA GTGGGCGGCG GTCTCGTCCG GCCGATGCAG
AGCCGCTGGG AGAAGATCCT CGACCGGGCC GAGGACGGCG CCGGAGAGGC CCGCGCCGCC
TGGCACGCGA ACCGGGGCGC GGCGCGGTGG GCACCGCCGA GCAGCGCCGG CCAGAGCCCC
CCGGGACGGC CGTCCACACC GCCCGGCGGC ACGCCCCCAC CACCCGCGCC AGCCAGCCGT
GACACAGCCA CGCCGCCCTG CGGCACGCCC ATGGCACCCC TCGGTGAGCC GGGGCCGGAC
CGCGGCGTGC CCGTACCCGA CCCGGGTACG GCGGTGGCAC GCCGCACCGC GCCTCAGACG
CGCCCGCACC CGGGGCCAGC GACGGAACCG GCGTCCGTGC CCACGCCGAC TCCGGTGCCG
CCGCGGGCGC CGGCGCCCAC ACCGGCGGCG GCGCCGGCAC CGCGGCACCC GCCCGTCCCG
CTACCGGGCG AGCGGGGAAC GGCGTCGGAG CGGCGTCGGA CCCCGGCGCC GTCAGCGGTA
CCCGTGCCAC CGCCCCCGCC GCGATCGGCT CCGCCGAGCA CAGCACGGCC TGGAGCGTCC
TCCCCGCCGG GAGCGCTACC GCGGGCCCCC TCGGCGGCCG ACGTGCCGGG ACGGGTACCG
TCCACGGATC CCGCACCCGC GTCGCCACGC ACAACCCTGC CATCGCCGTC GGTGCTCCCC
GGCACGCCAC CGACATCCCC TGTTCGGGGG GAGGACCAGC TCCCCGGCAC TGTTTCCGAC
TGA
 
Protein sequence
MDPRSEASCC AVAMTQAVQW DQGVSDAWSR IAQIGPKILV FALVLTVGAV VVRGALHAAD 
RVLEQAGLDG ALDRAGASRL LRCQTGVIRG WLLRVVAVIC LLAILRAALG VFGPSPADRM
AGTALALLAR ALLAAVIALL GLALAAWARR LVTESFAGLR HGNALSRAVA GFAVLAFGKA
ALDELGIGTS VTTPLLYAVL AACTGVVVVG VGGGLVRPMQ SRWEKILDRA EDGAGEARAA
WHANRGAARW APPSSAGQSP PGRPSTPPGG TPPPPAPASR DTATPPCGTP MAPLGEPGPD
RGVPVPDPGT AVARRTAPQT RPHPGPATEP ASVPTPTPVP PRAPAPTPAA APAPRHPPVP
LPGERGTASE RRRTPAPSAV PVPPPPPRSA PPSTARPGAS SPPGALPRAP SAADVPGRVP
STDPAPASPR TTLPSPSVLP GTPPTSPVRG EDQLPGTVSD