Gene Franean1_0184 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_0184 
Symbol 
ID5668609 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp222282 
End bp224009 
Gene Length1728 bp 
Protein Length575 aa 
Translation table11 
GC content75% 
IMG OID641239113 
Producthypothetical protein 
Protein accessionYP_001504557 
Protein GI158312049 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00494452 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTACGA CTGTCCCACG TCGGCACCGC GCTCGCCGCC GCGACCGGTA CCTTCCGGGG 
ATCGTGAACA GCGCGGGAGT AGCAGGTCCG GCAGGACGGG CGCGCGGCCC GGCACGCTGG
TACGCGGCCC TGCGAGGACG GCCGCTGATC GTCCGGTGGG TTGTCAGCCG GGCCATGCTG
CTCACCCTGG CGCTGGCCGG CCAGGTGTTC GGCGCGCAGC AGAGCGTTCT CGGTGATGTC
GATCTCTACC GGGAGTGGGG CCACACGCTC GTCGGTGACG GGACGGTGCC CGGGGACGAG
AAATGGCAGT ACCCGCCGGG CGCGGCGGTC GTCCTGGCGC TGCCCGCGGT GCCGCGCGAG
CTGGCCGGCG TCCCGTATGA GGTGTCCTTC TACCTGCTGA TGCTCGTCGT GGACGCGGTG
CTCACCCGGG CGCTGGCCCG CCGCTCACCC GCGGCAGCCC GGTACTGGCT GCTGGCCACG
CTCGCCCTCG GCCCGGTGAT GCTGACCCGG TTCGACCTGG TGCCGGCGGC CGCGGCGCTG
GCCGCGGTCC TCGCCCTGGA CGGCTCCGCG CCCGGTGGCG CCCATGGCTC CGGTGACGCT
GAGCCTCCTG ATGGCGCTGG TCGGCCGGGG CACTCCCGGC GCCTCCGCCC GTTCAGTTCT
TTCGGTGCGC TTGGTTCACT CGGTCGTTTC GGTGGCTGGG TGGTGCTGGG CGTGGCGGTG
AAGGTGTGGC CCGGGTTGCT CCTGGTGGCC CTGGGACGAC GCGGGCTGAC GCGCCCGGGC
TCGCCGGTGC TGGGCCGGGT GGCGCGGATC GTGGCGGGCG CGGGTGTGAC GGCGGCCGTC
CTGGCCGCCA TCCTCGTGCT CGCCGGTTGG TGGCGGGGCG CGCTGGGGTT CCTGGACGCG
CAGAGCGCCC GCGGCCTGCA GATCGAGGCG GTGCCGGCCA CCCCGTTCGT CGTGGCGCGG
ATGCTCGGGA TCGGCTCCGC CCCGGAGTAC TCCTACGGGT CACTCCAGTT CGACGGCGGG
CTCGCCAGGG CAGTGGCGAC GGCCTGCTCG CTCGCCGAGG TGATCGTGAT CGCCGCCGCA
GTCCTGTGGT GGTGGGCGCG GCGGTCCCCT GGTCGGGCCG TGGACGCCTC CCCCCGGCCG
GGCGGGGACG GTTCGATGGA CACGGAGGGC ACATCGTCGG TTGCCGGGCG TGGCCTCGCG
CTGGTCCTGC TGATTGTGAT CACCTCGCGG GTGCTCAGCC CGCAGTACCT GGTCTGGCTG
CTCGTCCTCG CCGCCGCCGT CCGCCCGTCC ACCGCCGGCG CCGAGCGTGT GCGGGCCGAG
TCATCGGACC ACTCCGGGAG CACAGGCGGC CGGCTCGGGT GGTCCGGGTG GCGTGGGCGG
AAGGTCGACG CGGCGGGGCT GCTCGCCGTC TGCGCTGTGC TGTCGCAGGT TGTCTACCCG
TGGCGCTACA ACGACGTCGT GCAGGGGCGG GTCGTCGGCG GGCTGCTGCT GGTCGCGCGC
AACGCCGTGC TCGTCACCGC GGCCTGGTAC GCGCTTCGGG CCGCGGCGCG GGAATCGTCC
GGCGACGGAC AGTCGTCCGG CGACGGACCG CCGGCACAGT CGCCGCCGAC ACCACTGCCG
CCAACACCAC CACCAGGGCA GCGCCGAGCA GCGGGTCGGC CCGGCCGAGC AGTCGCCAGG
TGGCGGCGAG CGGCGACAGC GTCGCGAGGC ACAGCCAAAC CGACGTGA
 
Protein sequence
MATTVPRRHR ARRRDRYLPG IVNSAGVAGP AGRARGPARW YAALRGRPLI VRWVVSRAML 
LTLALAGQVF GAQQSVLGDV DLYREWGHTL VGDGTVPGDE KWQYPPGAAV VLALPAVPRE
LAGVPYEVSF YLLMLVVDAV LTRALARRSP AAARYWLLAT LALGPVMLTR FDLVPAAAAL
AAVLALDGSA PGGAHGSGDA EPPDGAGRPG HSRRLRPFSS FGALGSLGRF GGWVVLGVAV
KVWPGLLLVA LGRRGLTRPG SPVLGRVARI VAGAGVTAAV LAAILVLAGW WRGALGFLDA
QSARGLQIEA VPATPFVVAR MLGIGSAPEY SYGSLQFDGG LARAVATACS LAEVIVIAAA
VLWWWARRSP GRAVDASPRP GGDGSMDTEG TSSVAGRGLA LVLLIVITSR VLSPQYLVWL
LVLAAAVRPS TAGAERVRAE SSDHSGSTGG RLGWSGWRGR KVDAAGLLAV CAVLSQVVYP
WRYNDVVQGR VVGGLLLVAR NAVLVTAAWY ALRAAARESS GDGQSSGDGP PAQSPPTPLP
PTPPPGQRRA AGRPGRAVAR WRRAATASRG TAKPT