Gene Franean1_5499 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_5499 
Symbol 
ID5673830 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp6657931 
End bp6659637 
Gene Length1707 bp 
Protein Length568 aa 
Translation table11 
GC content75% 
IMG OID641244354 
Producthypothetical protein 
Protein accessionYP_001509760 
Protein GI158317252 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.102476 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACCAAG GTCCAGCCAC CGAGTCTCTT CCCCCGCAGG TGCGCGGGCG GCACCGTTCG 
GGCCGGCGGG ATTCCGCGCT GGCACCGTCC CAGCCGCTCG GGTCGGCGCA GGTAGCGGGC
CAGGGGCTGC AGGAGGAGCC GCCGCCCGGC ACGCGGGGAG TGCCCGGTCC GGCGGTGATT
CCCGGGCAGG CGACACAGCC GCGTCGGCAG CACCCGAGCC TCGCGGCCTT CCCCGGGCTC
GCCGGCCTCG CGCCCGAGCG GCTGCGCGGC CAGAGCGACC GGATCCGGGC GGCCATGCGG
GGCGCGATCC CCCTCGCCCT CGCCGGGCTG ATCGCCAACG CGGCCAACCT CGGGGTCACC
CTGGTCATCG CGCGGGCCAT GAGCACCCGC TCCTACGGCG CCGTCGCGCA GCTTTTCGCG
ATCTTCTTCG TCGTCTCGAT GCCGGGCAGC GCCCTGCTCG TCGGTGTCGT CCGGCGGATC
ACGAACTGGC AGCACACCGG CCAGGCCGAC CTGATCGACG AGTGGATCGG CCGGGTCCGC
CGGGCCGGCG TCATCATGGT CCTCGCCGTC GCCGTCCTGG CGATCATCGC GCGCGGGTTC
GTCGCCCGCG AGCTCTCGCT GCCCGGAGCC GGCGGGGTCG CCGAGATCAT CATCGCCGGA
GCGGCCTGGT GCCTGCTGTG CGTCGACCGC GGGCTGATGC AGTGCGGGCG GCTTTACCCG
TCGCTCGCCG CGAACCTGCT GGTGGACGCG GCGGTCAAGA GCGGCTCGAC GATCGCGCTG
GTCCTGGCCG GACTGGACGA GGCCGGCGCG GCCATCGCCG TGCTGCTCGG GGTACTCGCC
GCACTCGCGC ACACCCGTTA CAGCCTGCGC CGGCACCCGT CGGCGATCAT GCAGGCCGAC
CCGACCCGCC CCCAGGCCCA AGCGCCGCCG GCCCCCACCC AGCTCCCGGC CCCCACCCAG
CTCCCGGCAC CGCCGGCCCG CCAGCGCCGG CTCACCGGCG GCCGGTTGGG ATCGCATCCG
GCGGCGGGGC GTGGCGACGC CGACACCGGG GCGACGACGC TGCCGCTGCC GGTCGCCGGG
CCGGCGGCCA CCGCGGAGCC GCGGCGGCTC GCCATCGAGG TCGGCGCCGC GCTGGTGACA
CTGGCGTTCC TCGGTGTGCT GCAGAACATC GATGTGCTGC TGCAGGGCCG GCTCGCCCCG
GACGAGTCGG GCTCCTACGC GGCCGTCTCC GTGGCGGCGA AGGTGATCGT GCTGGCCGCG
ATCGTCCTGG CCGGGTTCCT CCTGCCCGAG GCCGCGGACC GCAACCATCT CGGGCAGCAT
GCGCTGCATC AGCTCGGTGC GACGCTGGCG ATACTCGCGG TGCCCGCGGT GGGGCTGCTC
ACCGTAGCGG CGATCGCACC CGACACACTG CTGTCGCTGG CTTTCGGGCC GCGTTTCACC
AGTGCCTCCG GCGCTCTCCT CCCGCTGGCC GGAGCTATGA CCTGTCTCGG AGCGACCGTG
TTGTTCTCCC ACTATCTGCT GGCTCTCGGC AAGCGTGCCG TGCTGGCCGT GCTCGCCGTC
GCGACCGGCA CCGCCGTCGC CCTCATGGCC TGGGCGCAGG GCTCCCCCGT GTCGACCGCG
CGGGCGAACT TCGGCTGCCA GGCCGTGCTC GCCGTCGTCA CCGGGCTGAT GGTGCTCGCC
GCCGCCCGCC GGACGGCCCG CGCGTGA
 
Protein sequence
MNQGPATESL PPQVRGRHRS GRRDSALAPS QPLGSAQVAG QGLQEEPPPG TRGVPGPAVI 
PGQATQPRRQ HPSLAAFPGL AGLAPERLRG QSDRIRAAMR GAIPLALAGL IANAANLGVT
LVIARAMSTR SYGAVAQLFA IFFVVSMPGS ALLVGVVRRI TNWQHTGQAD LIDEWIGRVR
RAGVIMVLAV AVLAIIARGF VARELSLPGA GGVAEIIIAG AAWCLLCVDR GLMQCGRLYP
SLAANLLVDA AVKSGSTIAL VLAGLDEAGA AIAVLLGVLA ALAHTRYSLR RHPSAIMQAD
PTRPQAQAPP APTQLPAPTQ LPAPPARQRR LTGGRLGSHP AAGRGDADTG ATTLPLPVAG
PAATAEPRRL AIEVGAALVT LAFLGVLQNI DVLLQGRLAP DESGSYAAVS VAAKVIVLAA
IVLAGFLLPE AADRNHLGQH ALHQLGATLA ILAVPAVGLL TVAAIAPDTL LSLAFGPRFT
SASGALLPLA GAMTCLGATV LFSHYLLALG KRAVLAVLAV ATGTAVALMA WAQGSPVSTA
RANFGCQAVL AVVTGLMVLA AARRTARA