Gene Franean1_3809 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_3809 
Symbol 
ID5672173 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp4523468 
End bp4524958 
Gene Length1491 bp 
Protein Length496 aa 
Translation table11 
GC content74% 
IMG OID641242688 
ProductNLP/P60 protein 
Protein accessionYP_001508108 
Protein GI158315600 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0791] Cell wall-associated hydrolases (invasion-associated proteins) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.100771 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.247532 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGACGT ACACCGCCGC TCAGATCTAC GCTTACGCCC GGGCCGCCGG CTTCACCTCG 
GACCAGGCCG TGACGATGAC CGCCGTCGCG CTGGCCGAGT CGGGCGGCAA CGGGATGGCG
CACGCCACCC GGGGCGAGGA CTCGCGCGGG CTCTGGCAGA TCAACATGAA CGCCCACAAG
CAATGGGCGC ACCTGGACGC CTCTGACCCC GCGGTCAACG CCCGGATGGC CTACGAGGTC
TCCCGCCAGG GCCGGGACAT CTCCCCCTGG ACGGTCACCC ACGGCGGCGC GGATGCCCGC
TACCTGGACT ACCGCGCGCA GGCGCAGCGG GCCGCGATCG AGGCCGGGGA CGCGGGCGCG
CAGGGAAACT GGTCGGGGAC CGAGGGCTAC GGGCACGCGG TCGCCGCCGG CGGGGGTGGG
AGTGGGGGCG ACCACACGGC CGGCTTCGCG GCGAGCGGGT ACCCGACGTA CTCGGCCGGC
ACCGGCGGGA ACGGCCCGGG CGGCGCGCAG GTCGGCGAGA CCACCCGGCA TTTCCTCGAC
GCCGCGCTCG CTCAGGCCGG CGACCGCTAC GTGTACGGCG CCGAGGCGCA GCTGAACGAC
GCGGACCCCG ACGCCTTCGA CTGCTCGGAG CTGACCCAGT GGGCGGCGGC GCAGGCCGGC
GTCGAGATCC CCGACGGCGC CGCGAACCAG TACGAGGCCC TGCGCACCGA GGGCCAGGAG
GTCTCCGTCG ACGAGGCCCT GCGCACGCCC GGCGCGCTGC TGTTCCACGC CGACGCGAGC
GGCTACGTCA GCCACGTCGC GATCAGCCTC GGTGACGGGC GGACGATCGA AGCCCGCAAC
TCCCGCCTCG GCGTCGGGGT GTTCGACGGA CGGGAGAACT GGCTCAACCG GGCGGCCGTC
CTCCCCGGAC TGTCCGGGGA GGTCGGCGCG GGCGCGTTCG CGGCCGGGCC CGGAACCGGC
CTCGGGGCCG GGGGGGCCGC TGACGGTACC GACAGCGACG CGGACGGGCT CACCGACGCC
CTGGAAGCGT CGCTCGGCAG CAACGCCCAT GAGGTGGACA GCGACAGCGA CGGCTTGTCC
GACTCCTACG AACTGCTGCG CGTGCACTCG GACGTGATGT CGGCGGACAC CGACCACGAC
GGTCTGGCGG ACGCGCTCGA TCTGGCCTCC GGCTTCGACC CGACGAACGC GGACAGCACC
GGCACCGGTC GTCTGGACGG GGCGACCGGC GAGGCGGCCA GCCTCGACAC CGACGACGAC
GGGCTCGCCG ACGCGCTCGA GCGGGTCCTC GGCACCGACT CGACTCTGGC CGACAGCGAT
CACGACGGGG TGACCGACGG CGCGGAGCAC CTGAACCGCA TGGACCCGCT CGACCTGAAC
GACGTGGGGT CGCTGGTGCA CAGCGCGGAC ACCGGCGCTG GGACGGGAAC CGACCTCGGT
GCCACCCACC CCCCGGGCGG CACCGACCTC GACGGCACCA CCCACCTCTG A
 
Protein sequence
MATYTAAQIY AYARAAGFTS DQAVTMTAVA LAESGGNGMA HATRGEDSRG LWQINMNAHK 
QWAHLDASDP AVNARMAYEV SRQGRDISPW TVTHGGADAR YLDYRAQAQR AAIEAGDAGA
QGNWSGTEGY GHAVAAGGGG SGGDHTAGFA ASGYPTYSAG TGGNGPGGAQ VGETTRHFLD
AALAQAGDRY VYGAEAQLND ADPDAFDCSE LTQWAAAQAG VEIPDGAANQ YEALRTEGQE
VSVDEALRTP GALLFHADAS GYVSHVAISL GDGRTIEARN SRLGVGVFDG RENWLNRAAV
LPGLSGEVGA GAFAAGPGTG LGAGGAADGT DSDADGLTDA LEASLGSNAH EVDSDSDGLS
DSYELLRVHS DVMSADTDHD GLADALDLAS GFDPTNADST GTGRLDGATG EAASLDTDDD
GLADALERVL GTDSTLADSD HDGVTDGAEH LNRMDPLDLN DVGSLVHSAD TGAGTGTDLG
ATHPPGGTDL DGTTHL