Gene Franean1_4644 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_4644 
Symbol 
ID5672987 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp5541797 
End bp5543047 
Gene Length1251 bp 
Protein Length416 aa 
Translation table11 
GC content66% 
IMG OID641243502 
Productpentapeptide repeat-containing protein 
Protein accessionYP_001508918 
Protein GI158316410 
COG category[S] Function unknown 
COG ID[COG1357] Uncharacterized low-complexity proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCGACA GCGTAGTAGC CCGCCGGCGC ACCGGGCGCC GCTGGCTGTG GATCGTGGCC 
GGGGTTGCGG CCATCGGGGC GGTGTGTGCC GTGGTGGGGA TCTGGCATCT GCCAGACCGG
ATGTACCCGG GTACCTACGC CGAGGTGGGG GAGGCTCGCG CTGCGTTGCA GGCCGGGCTA
CTGACTGCTG CCGCCGCTCT GACCGCCGTA GCCGGTGGGC TGATTGCCTT GGACGAGACC
CGGCATGCCA ACGCCGAAGT GCGGCGGGCG AACGCGAACA CTCATGTCCG TGAGCTGTAC
GCGACCGCGA TTGGTCTTCT CAGCGCGGAT GACATCGATA GCCGCCTTGG TGGGATCTAC
GCGCTGGAAC GGATCGCTCG GGATAGCGCG GCTGACCATC GTATCGTCGT GGAGGTGCTC
TCGGCATTCC TGCGCGAGCA CACCCAGCCC GCTTCGGTGC TCGAGCAACG GCCACCTCCC
GGACGACGTT GGAGACATCC TCCGGTCGGA GCGGGTGGTG ACGACGAGGG CCGCGTCCGA
CTGCGGACGG ATATGCATGC CGCGTTCGCG GTCCTGGGGC GGCTCCCTGT CCGGCCCGGA
GCGCCCCCCG CTGACCTGAC AGGCCTTCAT CTGGGTGCGG CAGACCTGGC TGACGTTCAG
CTGACGGGCG CAGATCTCAC CGGCGCCCAG CTTGCTGGCG CAAATTTGAC CAATGCCTGG
CTAAGTGGAG CTAACCTCAC CCGAGCACAT CTTGACGGCG CAGTCTTGAC CGACGCCCGG
CTGGATCGGG CTGATCTCAC TCGGGCCCGG CTGGGAGGGG CGGACCTCAC TCGAGCCTGG
TTGCAGCATG CCAACCTCAC CCGAGCGCAG CTTGGCGGCG CTAATGTGAC CGACGCTCGC
CTGGTTGGCA CGGACCTTAC CGGAGCCCGA CTAGATGGTG CCAACCTCAC CCGCACCTGG
CTGGACGGTG CAAATCTCAC CGGCGCCCGA CTGGAAGGGG CGAAACTCGT CAACGCCTGG
TTGGAAAGGG CAAACCTCAT CGGTGCCCGG TTGATTGGAG CGGATCTTGA TGGGGCATGG
CTCAATGGAG TGGACCTTTT GGGTGCCTGG CTGAACGGAG CGGACCTCGC TCGCGTTGTG
GGATTGTCGC AGAGCCAGCT GGATGAGGCG CGGGGCAACG ACGAGACGCG GATACCAGAC
GGATTGGTAC GGCCAGAATC ATGGACGTCG GGGGACGGCA GTGGGGGATG A
 
Protein sequence
MADSVVARRR TGRRWLWIVA GVAAIGAVCA VVGIWHLPDR MYPGTYAEVG EARAALQAGL 
LTAAAALTAV AGGLIALDET RHANAEVRRA NANTHVRELY ATAIGLLSAD DIDSRLGGIY
ALERIARDSA ADHRIVVEVL SAFLREHTQP ASVLEQRPPP GRRWRHPPVG AGGDDEGRVR
LRTDMHAAFA VLGRLPVRPG APPADLTGLH LGAADLADVQ LTGADLTGAQ LAGANLTNAW
LSGANLTRAH LDGAVLTDAR LDRADLTRAR LGGADLTRAW LQHANLTRAQ LGGANVTDAR
LVGTDLTGAR LDGANLTRTW LDGANLTGAR LEGAKLVNAW LERANLIGAR LIGADLDGAW
LNGVDLLGAW LNGADLARVV GLSQSQLDEA RGNDETRIPD GLVRPESWTS GDGSGG