Gene Franean1_2594 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_2594 
Symbol 
ID5670988 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp3073287 
End bp3074738 
Gene Length1452 bp 
Protein Length483 aa 
Translation table11 
GC content71% 
IMG OID641241510 
Producttype II secretion system protein E 
Protein accessionYP_001506930 
Protein GI158314422 
COG category[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG4962] Flp pilus assembly protein, ATPase CpaF 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGTCC CGTCCTGGCA CGACCCGCAC CGACCAGCTG CGTCCGGGAA CGGTGCGGGC 
TGGCCCGGTC CGCTCGTCCC CTGGGACGGC GCCGGCGGTG GGGGCCCGGC TGCGCCGACC
GATCTTCCCG GCCGTGCCTC CGGCTCAGCG GCCGACGCCC TGCGCCTTCG GCTCCGTGAC
GGACTGCGGG CCGCGCTCGC CCGTCGGCTG CGGGCGGACG AAGACGCGGG CTCCCCGCCG
CTGACCGCGC AGGCACGCGA GGCGTTCGCC CGCTCCGTGC TCGTGGACCT GACCGAGGCC
CACACCACCG CCGAGCTGGC CCGCGGAGCG GCGGTCCTGA CGCCCGAGGA CGAGCAGCGC
GTCATTCACG AGGTCCTCGC CGAGGTCCTC GGGCTCGGCG GCCTCGAACC GCTCCTGGCT
GATGCCAGCA TCGAGAACAT CAACATCAAC GGTGATCGGG TGTTCATCCG CCGGGCGGAT
GGCAGCCGGC AGCGGCTTCC GGCGATCACC GGCTCGGATG CTGAGCTGGT CGGGCTGATC
CGTGACCTGG CAGCACATGC CGGGGTGGAG GAGCGGCGTT GGGACCGCGG CGCCCCCATG
GTCAATTTTC ATCTGGCCGA CAAGAGCCGC GTGTTCGCGG TCATGGCCGT CACCCAACGG
CCTTCCGTCA GCATCCGGCG GCACCGGTTC CGCCACGTCA CCCTGTCCGC GCTGCGGGCC
AACGGCACGA TCGACTACGG GCTGGAGGGT CTGCTCGCGG CGCTGGTGGC GGCGCGGAAG
AACATCGTGG TCGCCGGGGG CACCGCGATC GGGAAGACTA CGATGCTGCT CGCCTTGGCC
GACCAGATCC CACCGTCGGA GCGGTTGGTG ACGGTGGAGG ACGTCTACGA GCTCGGGCTC
GACGCCGACG AGCGGGCTCA CCCGGATGTG GTCGCCATGC AGGTGAGGGA ACCCAACACC
GAAGGCGAAG GCGCGATTTC TGCCTCAGAC CTGGTCCGGG CGGCGTTGCG GATGTCCCCC
GACCGGGTGA TCGTCGGCGA GGTCCGCGGG CCCGAGGTCA TTCCGATGCT CAACGCCATG
AGCCAGGGCA ATGACGGATC GATGACCACC CTGCACTCCT CGACTTCCCG CGGGGTGTTC
AGCCGGCTGG CCTCCTACGC CGTACAGGGC CCGGAACGGC TGCCCGTCGA GGCGACGAAC
CTGCTGATCG CCAGCGCGAT CCATGTCGTT GTCCATCTGG CCGAGCCGCG CGGTGAACCG
GGCCGCCGCG TCGTCTCGTC GGTGCGAGAG GTGGTCGACG CCGACGGTGT GCAGATCGTG
ACAAACGAGT TGTACCGGCC GGGTCCCGAC CGCCGCGGCC TGCCGGCGGC ACCGCCGACC
GGGGAGCTGC TCGACGACCT GATCGACGTC GGTTTCGACC CGGACCTGCT TGCCCGGGGG
TGGTGGGGAT GA
 
Protein sequence
MTVPSWHDPH RPAASGNGAG WPGPLVPWDG AGGGGPAAPT DLPGRASGSA ADALRLRLRD 
GLRAALARRL RADEDAGSPP LTAQAREAFA RSVLVDLTEA HTTAELARGA AVLTPEDEQR
VIHEVLAEVL GLGGLEPLLA DASIENININ GDRVFIRRAD GSRQRLPAIT GSDAELVGLI
RDLAAHAGVE ERRWDRGAPM VNFHLADKSR VFAVMAVTQR PSVSIRRHRF RHVTLSALRA
NGTIDYGLEG LLAALVAARK NIVVAGGTAI GKTTMLLALA DQIPPSERLV TVEDVYELGL
DADERAHPDV VAMQVREPNT EGEGAISASD LVRAALRMSP DRVIVGEVRG PEVIPMLNAM
SQGNDGSMTT LHSSTSRGVF SRLASYAVQG PERLPVEATN LLIASAIHVV VHLAEPRGEP
GRRVVSSVRE VVDADGVQIV TNELYRPGPD RRGLPAAPPT GELLDDLIDV GFDPDLLARG
WWG