Gene Francci3_0285 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_0285 
Symbol 
ID3903028 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp330380 
End bp331486 
Gene Length1107 bp 
Protein Length368 aa 
Translation table11 
GC content71% 
IMG OID637877614 
ProductNLP/P60 
Protein accessionYP_479401 
Protein GI86739001 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0791] Cell wall-associated hydrolases (invasion-associated proteins) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGTCCGCGC AAAGCGTGCA GCTCGACGAA GAAACGCGAG GCGCCGGTCG CGGCCGTCAC 
CGCGCACCCT CCGCGCCCCC GGCCGCCCGC AGCCGGGCCC GGGCCCGCGC CTTCGCCGCC
GTGACCACCG GGACCGTCGC GGTCTCCGGA GTGGCTCTCG CCGGCTGCGC CACCGACATC
AACGCCGACG CCGCGAAGGA CGAGCAGCCG AACACCGCTC CGATCACCCT CGCCGCGCAG
ATCGGCTCCC AGGCCGCCCT GGGGGGCAGC ATCGCCGCCG CGGCGGTCAG CACCCACGGC
TCCACCTCCC TGCTCGGCTC GACGGGCGGT GTCGACGCCC CGCCCCTGGC CGGCAAGGTT
CAGGTCGGTC TCCGGGTGAC CAACCCCGAC GTGAGCGTCA GCGCGGACCA GCCGATCGAC
ATCGGCTTCT CGCTCTACAA CGAGCAGACC CACGAACCGC TGGCGAACCA GCTGGTCAAG
GTGCAGGTGA AGCTCCCCAC CGGGTGGGCC ACCTTCAAGC ACCTTTACAC CAACGCCCAG
GGCTACGCCT CCTACACGGC CCGGGTGCTC ACCACCACGA ATGTCACCGC GGTCTTCGAC
GGCACGGACG CCCTGCAGTC CGCCCGCTCG GCGAACGACG CCACCCTGCG CGTACGGCCG
GCGCCCACCC CCCGGCTCGT CCGCAACGCC GCCTGGTCGG ATCTCCTCAC GACAGGGGAC
GCCGCGGACC AGGCGTCGGT TCCGGTCCCG TCGAGCTCCC TCGGGGAGAA GGCCGTCTAC
CTGGCCTCCC TGCAGGCCGG CAAGCCCTAC GTCTACGGTG CGGAGGGCCC GAGCTCGTTC
GACTGCTCCG GCCTCATCCA GTACATCTTC AAGCAGCTCG GCAGGAGCGT GCCGCGGACG
ACCGACGCCC AGTTCGCCGC CGCCACCCGG GTGTCGCAGT ACAACAAACA GCCCGGCGAC
CTGATCTTCT TCGGGACACC CGGGAACATT TACCACGTCG GCATCTACGC GGGCGACGGC
ATGATGTGGG CGGCGCCGCA CACCGGCGAC GTCGTGTCGC TCAAGAAGAT CTATACCACC
TCCTACTACG TCGGTCGGAT CCTCTAG
 
Protein sequence
MSAQSVQLDE ETRGAGRGRH RAPSAPPAAR SRARARAFAA VTTGTVAVSG VALAGCATDI 
NADAAKDEQP NTAPITLAAQ IGSQAALGGS IAAAAVSTHG STSLLGSTGG VDAPPLAGKV
QVGLRVTNPD VSVSADQPID IGFSLYNEQT HEPLANQLVK VQVKLPTGWA TFKHLYTNAQ
GYASYTARVL TTTNVTAVFD GTDALQSARS ANDATLRVRP APTPRLVRNA AWSDLLTTGD
AADQASVPVP SSSLGEKAVY LASLQAGKPY VYGAEGPSSF DCSGLIQYIF KQLGRSVPRT
TDAQFAAATR VSQYNKQPGD LIFFGTPGNI YHVGIYAGDG MMWAAPHTGD VVSLKKIYTT
SYYVGRIL