Gene Franean1_0503 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_0503 
Symbol 
ID5668922 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp583538 
End bp585370 
Gene Length1833 bp 
Protein Length610 aa 
Translation table11 
GC content70% 
IMG OID641239432 
Producthypothetical protein 
Protein accessionYP_001504870 
Protein GI158312362 
COG category[V] Defense mechanisms 
COG ID[COG0577] ABC-type antimicrobial peptide transport system, permease component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGGCGAC GGCACACAAC GGCCGGGGCC CGCAACGGTA TGCTGCCGAT ATTCGGCCTC 
GGCATCGCGG CCTGGTACGA CGTACGGACC AGGCCGTTGC GAACGCTGGC CGCGGTCGCT
GGGATGATCG CCGCGGTCGG CGCACTGATG GTGGTAGACG CGGCTGGGGT CCTTTCCCGA
CACGGCGCGG ACGACTATCT GCGCCGCCAG TACGGTCAAC CCGCGACTTT TCTGATCTCC
GGTACCGGCT CCTCCGATGC GCGACGGCAA ACCGAGGCCG ACACCCTGCT CACCCAGGGG
CTTGACCAGG CGGGATTCAG AACGATCAGC CCCTACCAGC TGGTTCGTGG GATCACGCTC
GACGGCGACC ATTCGGCGGC CGTCGACACG GCGGTCGTCG ACGGCTCCTA CCCGGATGTC
AGAGTCATCG ATCTCGTCGC GGGGGACTTC CCCACCCGGG CAGCTGACAG AGGTGTTCCC
CGCGCCGTCC TCGAGGAGGG GATCGCGCGC CAGCTCGGCT GGGAGCCTCA CCAGATCGTC
GGACGACAGC TCCAGTACGC GGTGACCGGA ACAGCCGCCA CCATCGACAC CCGGAACCCG
CGAACGATGA CGGTCGTCGT GGACGCGGTC ATGGCCTCCA CGGCACGCGG CGATTCCGTC
CCACGGCTCC TCCTTGTCGG TCGCCTGCCC CCACAGCTGC ACTCGGCGGC GGAAGACACC
CCAGGTAGAT GGCTGGCCCG GGTGGCACCT GCCGATACCG CCTACTTCCA GGAAGTCGTG
GCGAGCACGG GGCGACGGCT CGGATCGGGA AGCGGCTTCG AGGCCACCAG GCTGGATCAG
GACGAGGCTC TGGCACCCGT CATCGATCAG CAGACCCGGA CCGCCGGCAT CGTCGCCTGG
GTGGCGATGG CCGTCGGTGG CCTCGGCATG CTCGGTGTCG GCGTCGCCTC CGTCCGGGAA
CGTGCTCGTG AACTCGGCGT GCGCCGAGCG CTCGGCGCAT CCCGCGCAAC CATCTTCCTG
GGCGTTCTCG TCGAGACAAT GTTCAACGTG CTGGTAGCGG CCGCACTCGC CGTGCCACTG
GCCGCCCTGA TCGTCCACTT GCTGCCACTA CCTGTTCAAC ATGGGCACCG CCACATCGCG
CTTCGTCGTC GACCTGCGCG GATCGGGCAC CGCTCGGCTC CTGGACCTCT GGAACGGCAC
CACCCAGCCG GTCGCCAGCA CCGTCGACGG CGACCGTCAC ACCCTTGCCT TCACGCTCGG
TCCCGGCGGG ACGGCGGTCG TTCACCTCGG CACCGGGCCG GGTCTTGCCG CCCGCCCGGT
CCAGACCGAG GCCCCGAGCG AACCCGCGCA GGTCGTGTGG TCGCAGAACC TGAACCGCTG
GGACATCGAC GTGGCGGACT GGCGGCCCAC GGGGACGGTG CGGCACCGTC AGTCCATCAG
CGCGCCTGCC GACTGGCGGA CCATCCCGGC GCTCGCCGAT GCGGCCGGTG TCGGTACCTC
TACCGCACGA CGGTGGTTCT GCCAGCCGGT TGGGACACCG GCGTCGGCGC GGTGCTGCTG
GAGCTGGGCC AGGTCGCGGG GACGATGCGG GTGAGTGTCA ACGGGCGGAG CGCGGGCGAT
GACTGCGTCG CGACCCGAGC GCTGGACGTG CGTCCGCTCC TACGGGATGG CGCCAACCAC
GTTGAGATGG AGATCGCCAC GACCCTGGGA AACCGCCTGA TCGGACTCGG TAAGGCGGGC
GAGGAGGCCT ACGCGCGGTT CGCCAAACGG TCGCCCCGAC CGGCGGGCCT CATCGGGCCC
GTGTCACTGA GTGGGATCGC CGCCACGACG TGA
 
Protein sequence
MRRRHTTAGA RNGMLPIFGL GIAAWYDVRT RPLRTLAAVA GMIAAVGALM VVDAAGVLSR 
HGADDYLRRQ YGQPATFLIS GTGSSDARRQ TEADTLLTQG LDQAGFRTIS PYQLVRGITL
DGDHSAAVDT AVVDGSYPDV RVIDLVAGDF PTRAADRGVP RAVLEEGIAR QLGWEPHQIV
GRQLQYAVTG TAATIDTRNP RTMTVVVDAV MASTARGDSV PRLLLVGRLP PQLHSAAEDT
PGRWLARVAP ADTAYFQEVV ASTGRRLGSG SGFEATRLDQ DEALAPVIDQ QTRTAGIVAW
VAMAVGGLGM LGVGVASVRE RARELGVRRA LGASRATIFL GVLVETMFNV LVAAALAVPL
AALIVHLLPL PVQHGHRHIA LRRRPARIGH RSAPGPLERH HPAGRQHRRR RPSHPCLHAR
SRRDGGRSPR HRAGSCRPPG PDRGPERTRA GRVVAEPEPL GHRRGGLAAH GDGAAPSVHQ
RACRLADHPG ARRCGRCRYL YRTTVVLPAG WDTGVGAVLL ELGQVAGTMR VSVNGRSAGD
DCVATRALDV RPLLRDGANH VEMEIATTLG NRLIGLGKAG EEAYARFAKR SPRPAGLIGP
VSLSGIAATT