Gene Franean1_0913 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_0913 
Symbol 
ID5669327 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp1062862 
End bp1063959 
Gene Length1098 bp 
Protein Length365 aa 
Translation table11 
GC content73% 
IMG OID641239840 
Producthypothetical protein 
Protein accessionYP_001505275 
Protein GI158312767 
COG category[R] General function prediction only 
COG ID[COG1253] Hemolysins and related proteins containing CBS domains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.471869 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.209889 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACCTGC TGGCTGTTCT CGCCGGCGTG CTGCTGCTCG CCGGGAACGC GTTCTTCGTC 
GGCGCCGAGT TCGCGCTGAT CTCGGCCCGC CGCGACCGGG TCGAGCCGAT GGCGGAGGAC
GGCGACAAGC GGGCGGCGGC CGTCCTGTCG CACATGGAGC ACCTCTCCCC CATGCTCGCC
GCCACCCAGC TCGGCATCAC CGTGTGCTCG CTGGCCTTGG GCGCGGTCGC CGAGCCGGCG
GTCGCCCACC TCCTCGAGGC CGGGCTCGAG GCGGCGAACG TCCCCGTCGG CGCCCAGCAC
GCCATCGCCT TCGTGATCGC GCTGTGCATC GTGGTGTCGC TGCACATGGT GCTGGGCGAG
ATGGTCCCGA AGAACCTCTC GATCGCCGGC CCGGAGCGCG CGGCGCTCTG GCTGGGCCCG
CCGCTGTTCG CGTTCGCCCG GTTCACCCGG CCGTTCATCG CCTTCCTGAA CCACTTCGCG
AACGCGGTGC TGCGGCTGCT GCGGGTCACC CCGTCGGACG AGCTGACCTC CGCCTACACC
CCGGAGGAAC TGGGGGCGCT GATCGGGCAG TCCCGGCAGG AGGGCCTGCT GCCCGCCGGT
GAGCACGAGC TGCTCACGCA CGCGCTCGAA CTGTCCGGAC GGACCGTCCG GACAGTGATG
ATCCCGTTGT CGGAGATCGT CACCGTGCCG TGGACGGTCA CCGCCGCCCA GCTGGAGGAG
GCCGTGGCCG AAACGGGCTA CTCCCGGTTC CCCGTCCGTG CCCCGGGCCA GGACGCCGGT
CGCGAGCCAG GCGGTGGCGT GGGGCCCGTG GCGGAGCCGG CCGGCTTCCT GCACGCCAAG
GACGTCCTCG GTGTTCCCGA GCAGGAGCGC GACGAGCCGC TGCCGCCCCG CCGGCTGCGC
CGGATGGCCG AGATCGGGGT CGACCTGCAC CTGGACGAGG CGCTCCGCCT CATGCAGCGC
ACCAACAGCC ACCTGGGCCG GGCGGTGGAC GCGGCCGGCA CCACCCTGGG CGTCGTCGCC
ATGGAGGACG TCGTCGAGGA GTTCGTCGGC GAGGTGGAGG ACGCGAGCCA TCGCGAGACG
GCCGACCCCC GACCGTGA
 
Protein sequence
MNLLAVLAGV LLLAGNAFFV GAEFALISAR RDRVEPMAED GDKRAAAVLS HMEHLSPMLA 
ATQLGITVCS LALGAVAEPA VAHLLEAGLE AANVPVGAQH AIAFVIALCI VVSLHMVLGE
MVPKNLSIAG PERAALWLGP PLFAFARFTR PFIAFLNHFA NAVLRLLRVT PSDELTSAYT
PEELGALIGQ SRQEGLLPAG EHELLTHALE LSGRTVRTVM IPLSEIVTVP WTVTAAQLEE
AVAETGYSRF PVRAPGQDAG REPGGGVGPV AEPAGFLHAK DVLGVPEQER DEPLPPRRLR
RMAEIGVDLH LDEALRLMQR TNSHLGRAVD AAGTTLGVVA MEDVVEEFVG EVEDASHRET
ADPRP