Gene Franean1_7260 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_7260 
Symbol 
ID5675561 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp8862213 
End bp8864369 
Gene Length2157 bp 
Protein Length718 aa 
Translation table11 
GC content76% 
IMG OID641246097 
Producthypothetical protein 
Protein accessionYP_001511485 
Protein GI158318977 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1807] 4-amino-4-deoxy-L-arabinose transferase and related glycosyltransferases of PMT family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.131718 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCTGGCCG AGCCGGCCGG TGGAACCGAC CGGCCAGCCG GCCGAACCGC CCACGAGCTG 
GACATCCCGC CGATCCGGGG CGGCGGCGTG ACCCGCGGCG ACGGAGCGGT TCAGGCCGAC
TCGGCGGTCC AGGCCAGCGG GGCGGTTCAG GCCGTCGCGG TGATCAGCGG CGGCGTGGTG
GGGCCTGCGC CGGACACACG TGAAACAACC GGGCCGCGAG GCGCGGTCGA CATGCTGGCG
GTGGTCGTAG TCAGCCTGCT GGCGGTCGCG GTCCAGGCGG TGCTGTGGGA ACGCTTCCGC
CAGCCGCTCT GGTACGACGA GCTGTGGCGC CCGCACTTCG TCGCCGAGCC GCCGGGGACC
TTCTGGTCGG AGCTCTCGGT GGCGAACACG CCGTCCGCGA TCGGCTCGAT GGGTCTGCTC
CGGGTGTGCG GCGACGTGTT CGGGTGGCAT GCCTGGGCCC TGCGGCTACC CTCCGCCGTT
CCCCTCGTCG CGCTCGCCGC GGGCACCTGG CTACTGGCCC GCCGGCTGAC CGGACGCACA
GCGGCGTTCA CGGCGGCACT GACGGTCACG CTCGGCGGCA CCGTCGTGGA CCTCGCGTCC
CAGGTCAAGC CCTACACCCT CGACGCGGCC TGCGCGCTGG CCGTGGTCAT GCTCTGGATG
CGCGCCGCTT CCACCACCCG CGCGCTGCTG TGGAGCCGGT TCGCCGTCGG CGTCCTCGCA
CTGTTCTCCC TACCCGCGGT CTTCCTCATC GTCCCGCTGA CCGTCGTGGA CGTCGCCCGG
CCTCTGCACG GCGCCTATGG CTGGGCTGGC CGCCGCACGG CCGCCCGCCC AGCTGCTCGC
AGGGCCTTCC GTCCACCTGC TCGCGCGGCC GTTCGCGCCG CGCTGTGCGA CGCCCCCGCC
GTCGTGATCG CGGGGCTGCA CGCGGCGCTG TTCGTGTTGC ACCAGTCCGG CCAACGGGCG
AGCACGTTCT GGGACGACCA CTTCCTGGCC GGGCGCGGAC TGCCCGACGG GATCCGTTTC
GTCGCCGACC AGCTCGCGGC GACCGCGGCC GGCGCGCCCC CGGGCATCGA CCGCTATGAC
CCGAACCTGG TTCACCCGCT GACCGACTCG TCCGACCCGG CAGTGATCGT GCTTGGCTGT
GCCGTCGCGA TCACGTTCCT GGCCGGAGTG GTCACGCTCG CCCGGCGCCC GGATGGCCGG
GTGCTGCTGG TCGCGACCGG CGGCGCCGAG CTGATGGTGC TGGCGGCGAG CGCCGGGCGG
TACTGGCCGT TCGGCGCCGT GCGGACGAAC GTCTTCCTCG TCCCGCTGCT CACCGTGGTC
GCGGCCGCCG GAGCCGCCAC GCTGGCCCGC ACCGCCCGCG TCGCGTGGGC GGGAGCCGGG
GTCGCCGGCC GCCCCGGGGA CGGTCCTTCC GGTGGCGATC CCGCCGGTGG CCGCGCGGCC
GGTGGCCGCA CGGCCGGCTG GCAGCGCACA GCCGGCTGGC AGCGCACGGC GGCGGGCCTG
CTGGCAGCGG TGACGGCGGT GACCGTCCTC GCCGCGGCCG CGGTGCCCGC CGCCGCGGTC
AGCGCCCTGC ACCCACTGTG GGAGGAACGC GGCGACCGCC GTCCGATCGA CCTGATGGTG
GACGCGACCG TCACCGCCCG CCGGCTCTAC CGGCCCGGCG ACCTGGTCGT CGTAGGCGGC
CGGCTCGCCC GCGCGGGTTG GCTGTACGGG ATGGAGGTAA GCCAGGACGG CGCCTCCGAC
GACCGGACAC CCGGCCAGGA CGCAGCCGGC CAAAGTGCGG CTGGCCAGGA CGCGCCCGCC
GCGAACACAG CCGGTCAGGG CCCGACCCGT CAGGATGCGC CCGGTCAGGA ACGTCCCGGC
GGGCCGGGCG CGGATGTGGT GACCGGGCCG GTGGGACCGC GGGTGCCGCG CTCCTCGACC
GTGTTCCTGA CCGCGATCGG AGATGGCGGC GTCGGGCGGG CACTCACCCG GCGCGCCCCG
GGCCCGGAGG GGCGGGTGCT GCTGTTCGTC CTCGCCTACG ACCGCCGGGG CACGGGCCAC
TCGCTCGACG AGGCACGGGC CGCGGGCTGG TGCCCCGCCT GGACACACGA CTTCGAGCTC
ACCGGCACGC TGCGCGTCCT CACCCCGTGC GGGCACCGAG CCGACAACGC AGGCTGA
 
Protein sequence
MLAEPAGGTD RPAGRTAHEL DIPPIRGGGV TRGDGAVQAD SAVQASGAVQ AVAVISGGVV 
GPAPDTRETT GPRGAVDMLA VVVVSLLAVA VQAVLWERFR QPLWYDELWR PHFVAEPPGT
FWSELSVANT PSAIGSMGLL RVCGDVFGWH AWALRLPSAV PLVALAAGTW LLARRLTGRT
AAFTAALTVT LGGTVVDLAS QVKPYTLDAA CALAVVMLWM RAASTTRALL WSRFAVGVLA
LFSLPAVFLI VPLTVVDVAR PLHGAYGWAG RRTAARPAAR RAFRPPARAA VRAALCDAPA
VVIAGLHAAL FVLHQSGQRA STFWDDHFLA GRGLPDGIRF VADQLAATAA GAPPGIDRYD
PNLVHPLTDS SDPAVIVLGC AVAITFLAGV VTLARRPDGR VLLVATGGAE LMVLAASAGR
YWPFGAVRTN VFLVPLLTVV AAAGAATLAR TARVAWAGAG VAGRPGDGPS GGDPAGGRAA
GGRTAGWQRT AGWQRTAAGL LAAVTAVTVL AAAAVPAAAV SALHPLWEER GDRRPIDLMV
DATVTARRLY RPGDLVVVGG RLARAGWLYG MEVSQDGASD DRTPGQDAAG QSAAGQDAPA
ANTAGQGPTR QDAPGQERPG GPGADVVTGP VGPRVPRSST VFLTAIGDGG VGRALTRRAP
GPEGRVLLFV LAYDRRGTGH SLDEARAAGW CPAWTHDFEL TGTLRVLTPC GHRADNAG