Gene Franean1_4197 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_4197 
Symbol 
ID5672552 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp4996976 
End bp4998307 
Gene Length1332 bp 
Protein Length443 aa 
Translation table11 
GC content73% 
IMG OID641243070 
Producthypothetical protein 
Protein accessionYP_001508487 
Protein GI158315979 
COG category[R] General function prediction only 
COG ID[COG3211] Predicted phosphatase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.017102 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.75785 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAGTCA ACCGCCGTCA GGTGGTCGTG GGAACCGGCG CGGCCGGCCT GGGCTTCACC 
CTGTCGGGTG CGGTGAGCTC GGTGTTCGCC GGCACCGCGT CCGCCGCAAC ACCGAAGAAG
TTCGCCGGGT ACGGCGAGCT GGTCCCGGAT CCGAAGGGCC TGGTCGATCT TCCCTCCGGC
TTCCGGTACA CCGTGCTGTC CCGAGCCGGG GTCGACTCCC TGACCGGCGG GGGTGTGGTG
CCCGGCGCGC CCGACGGGAC GTACGCGTTC CCACTCGGCC CGGGCCGCAG CGTTCTCGTC
CGCAACCACG AGCTCTCCCC CGGAGGCACG GACCTCGTCC CGCAGCGGCC GGGCATCACC
TACGACCCGG CCGCCCCGGG CGGGACGACG ACCGTCACCG TCGCCGGTGA CCGGCTGCTG
TCGGCCGTGC CCAGCCTGGC CGGCACCATC CGCAACTGCG CCGGCGGCAA CACCCCGTGG
CGCACCTGGC TGTCCTGCGA GGAGACCGAG GACACCCCGG CGACCAACCC CGCGCTCACC
AAGCGGCACG GCTACGTCTT CGAGGTCGAC CCGTTCGGCC GGCTGCGTGA CCGCGAGGCG
GTGCCGCTGA CCGCGCTCGG CCGGTTCGCG CACGAGGCCG TCGCGGTCGA CCCCCAGTCG
GGCTGCCTCT ACCTGACCGA GGACGCGTCC AAGCCGTACG GCCTGATCTA CCGCTTCCTG
CCCCGCCGGC CGCTCGGCGG GCCCGGCAGC CTGCGGGCCG GCGGCAAGCT GCAGGCGCTG
CAGGTCCCCG GTGTCCCGGA CCTGTCCGCC ATCAGCGAGC TGCACACCAC GGTGCAACTG
TCCTGGGTCG ACGTCCCCGA CCCGGACGCC GCCACGGTGT CGACCCGCAA GCAGTTCGCC
GCCGGCAAGG TCACCCAGGT CCCGAAGGCC GAGGGCATCT TCTGGTCCGG GCGCTCCGCC
TACGTGGTCT CCAGCTACGC CAAGACCGCG GACGGCGCCG CCCGTGACCA CGCCGGCCAG
GTCTGGAAGC TCGACCCGAA GAAGGGCACC CTCGAGCTGG TGCTGCTGAT CGAGCCGGGC
GGCCGGTTCG ACGGCCCCGA CAACATCACC GTGTCGCCGG GCGGCGGCAT CGTGCTGTGC
GAGGACGGCG ACGGCGAGCA GCACCTGATC GCGCCGTCGG CCGAGGGTGT CCCCTACCCG
CTGGCCCGCA ACGCCACCAG CGAGAGCGAG TGGGCCGGCG CCACGTTCTC CGCGGACGGC
CGCTGGCTCT ACGCCAACAT CCAGAGCGAC GGCCTCACGG TGGCGATCAC CGGCCCGTGG
TGGCGCGGCT GA
 
Protein sequence
MAVNRRQVVV GTGAAGLGFT LSGAVSSVFA GTASAATPKK FAGYGELVPD PKGLVDLPSG 
FRYTVLSRAG VDSLTGGGVV PGAPDGTYAF PLGPGRSVLV RNHELSPGGT DLVPQRPGIT
YDPAAPGGTT TVTVAGDRLL SAVPSLAGTI RNCAGGNTPW RTWLSCEETE DTPATNPALT
KRHGYVFEVD PFGRLRDREA VPLTALGRFA HEAVAVDPQS GCLYLTEDAS KPYGLIYRFL
PRRPLGGPGS LRAGGKLQAL QVPGVPDLSA ISELHTTVQL SWVDVPDPDA ATVSTRKQFA
AGKVTQVPKA EGIFWSGRSA YVVSSYAKTA DGAARDHAGQ VWKLDPKKGT LELVLLIEPG
GRFDGPDNIT VSPGGGIVLC EDGDGEQHLI APSAEGVPYP LARNATSESE WAGATFSADG
RWLYANIQSD GLTVAITGPW WRG